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(54) Tide: NON-ENDOGENOUS, CONSTITUTIVELY ACTIVATED HUMAN G PROTEIN-COUPLED RECEPTORS 



(57) Abstract 

Disclosed herein are constitutively activated, non-endogenous versions of endogenous human G protein-coupled receptors comprising 
(a) the following amino acid sequence region (C-terminus to N-terminus orientation) and/or (b) the following nucleic acid sequence 
region (3' to 5* orientation) transversing the transmembrane-6 (TM6) and intracellular loop-3 (IC3) regions of the GPCR: (a) P 1 AA15 
X and/or (b) pcod<» (AA-codon)is Xcodom respectively. In a most preferred embodiment, P 1 and P 00 * 0 " ^ endogenous proline and an 
endogenous nucleic acid encoding region encoding proline, respectively, located within TM6 of the non-endogenous GPCR; A A 15 and 
(AA-codon)is are 15 endogenous amino acid residues and 15 codons encoding endogenous amino acid residues, respectively; and X and 
Xcodoo are non-endogenous lysine and a non-endogenous nucleic acid encoding region encoding lysine, respectively, located within IC3 
of the non-endogenous GPCR. Because it is most preferred that the non-endogenous human GPCRs which incorporate these mutations 
are incorporated into mammalian cells and utilized for the screening of the candidate compounds, the non-endogenous human GPCR 
incorporating the mutation need not be purified and isolated per se (i.e., these are incorporated within the cellular membrane of a mammalian 
cell), although such purified and isolated non-endogenous human GPCRs are well within the purview of this disclosure. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Behan, Dominic P. 

Chalmers, Derek T. 
Liaw, Chen W. 

(ii) TITLE OF INVENTION: Non- Endogenous , Constitutively 

Activated Human G Protein- Coupled 
Orphan Receptors 

(iii) NUMBER OF SEQUENCES: 280 



(iv) ' CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Arena Pharmaceuticals, Inc. 

(B) STREET: 6166 Nancy Ridge Drive 

(C) CITY: San Diego 
15 (D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92122 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

20 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: US 

25 <B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Burgoon, Richard P. 

(B) REGISTRATION NUMBER: 34,787 

30 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (619)453-7200 

(B) TELEFAX: (619)453-7210 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

40 (xi) "SEQUENCE DESCRIPTION: SEQ ID NO:l: 



ATGGAAGATT TGGAGGAAAC ATTATTTGAA GAATTTGAAA ACTATTCCTA TGACCTAGAC 60 
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TATTACTCTC TGGAGTCTGA TTTGGAGGAG AAAGTCCAGC TGGGAGTTGT TCACTGGGTC 120 

TCCCTGGTGT TATATTGTTT GGCTTTTGTT CTGGGAATTC CAGGAAATGC CATCGTCATT 180 

TGGTTCACGG GGCTCAAGTG GAAGAAGACA GTCACCACTC TGTGGTTCCT CAATCTAGCC 240 

ATTGCGGATT TCATTTTTCT TCTCTTTCTG CCCCTGTACA TCTCCTATGT GGCCATGAAT 300 

5 TTCCACTGGC CCTTTGGCAT CTGGCTGTGC AAAGCCAATT CCTTCACTGC CCAGTTGAAC 360 

ATGTTTGCCA GTGTTTTTTT CCTGACAGTG ATCAGCCTGG ACCACTATAT CCACTTGATC 420 

CATCCTGTCT TATCTCATCG GCATCGAACC CTCAAGAACT CTCTGATTGT CATTATATTC 480 

ATCTGGCTTT TGGCTTCTCT AATTGGCGGT CCTGCCCTGT ACTTCCGGGA CAGTGTGGAG 540 

TTCAATAATC ATACTCTTTG CTATAACAAT TTTCAGAAGC ATGATCCTGA CCTCACTTTG 600 

10 ATCAGGCACC ATGTTCTGAC TTGGGTGAAA TTTATCATTG GCTATCTCTT CCCTTTGCTA 660 

ACAATGAGTA TTTGCTACTT GTGTCTCATC TTCAAGGTGA AGAAGCGAAC AGTCCTGATC 720 

TCCAGTAGGC ATTTCTGGAC AATTCTGGTT GTGGTTGTGG CCTTTGTGGT TTGCTGGACT 780 

CCTTATCACC TGTTTAGCAT TTGGGAGCTC ACCATTCACC ACAATAGCTA TTCCCAtlCAT 840 

GTGATGCAGG CTGGAATCCC CCTCTCCACT GGTTTGGCAT TCCTCAATAG TTGCTTGAAC 900 

15 CCCATCCTTT ATGTCCTAAT TAGTAAGAAG TTCCAAGCTC GCTTCCGGTC CTCAGTTGCT 960 

GAGATACTCA AGTACACACT GTGGGAAGTC AGCTGTTCTG GCACAGTGAG TGAACAGCTC 1020 

AGGAACTCAG AAACCAAGAA TCTGTGTCTC CTGGAAACAG CTCAATAA 1068 
(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Glu Asp Leu Glu Glu Thr Leu Phe Glu Glu Phe Glu Asn Tyr Ser 
15 10 15 

Tyr Asp Leu Asp Tyr Tyr Ser Leu Glu Ser Asp Leu Glu Glu Lys Val 
20 25 30 

30 Gin Leu Gly Val Val His Trp Val Ser Leu Val Leu Tyr Cys Leu Ala 

35 40 45 
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10 



15 



20 



25 



30 



35 



Phe Val Leu Gly He Pro Gly Asn Ala He Val He Trp Phe Thr 
50 55 60 



Gly 



Leu Lys Trp Lys Lys Thr Val Thr Thr Leu Trp Phe Leu Asn Leu Ala 
" 70 75 80 

lie Ala Asp Phe lie Phe Leu Leu Phe Leu Pro Leu Tyr lie Ser Tyr 
85 



90 



95 



Val Ala Met Asn Phe His Trp Pro Phe Gly lie Trp Leu Cys Lys Ala 
100 105 



110 



Asn Ser Phe Thr Ala Gin Leu Asn Met Phe Ala Ser Val Phe Phe 



115 



120 



Leu 



125 



Thr Val He Ser Leu Asp His Tyr lie His Leu lie His Pro Val Leu 



130 



135 



140 



Ser His Arg His Arg Thr Leu Lys Ash Ser Leu He Val He He Phe 



145 



150 



155 



160 



He Trp Leu Leu Ala Ser Leu He Gly Gly Pro Ala Leu Tyr Phe Arg 
165 170 



175 



Asp Thr Val Glu Phe Asn Asn His Thr Leu Cys Tyr Asn Asn Phe Gin 
180 185 19Q 

Lys His Asp Pro Asp Leu Thr Leu He Arg His His Val Leu Thr Trp 



200 



205 



Val Lys Phe He He Gly Tyr Leu Phe Pro Leu Leu Thr Met Ser He 
210 215 



220 



Cys Tyr Leu Cys Leu He Phe Lys Val Lys Lys Arg Thr Val 



225 



230 



Leu He 



235 240 
Ser Ser Arg His Phe Trp Thr He Leu Val Val Val Val Ala Phe Val 



245 



250 255 
Val Cys Trp Thr Pro Tyr His Leu Phe Ser lie Trp Glu Leu Thr lie 



260 



265 270 
His His Asn Ser Tyr Ser His His Val Met Gin Ala Gly He Pro Leu 



275 



280 285 
Ser Thr Gly Leu Ala Phe Leu Asn Ser Cys. Leu Asn Pro He Leu Tyr 



295 3 0 o 
Val Leu lie Ser Lys Lys Phe Gin Ala Arg Phe Arg Ser Ser Val Ala 



310 



315 



320 



Glu He Leu Lys Tyr Thr Leu Trp Glu Val Ser Cys Ser Gly Thr Val 
325 330 



335 



Ser Glu Gin Leu Arg Asn Ser Glu Thr Lys Asn Leu Cys Leu Leu 



Glu 
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340 

Thr Ala Gin 
355 

(4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1089 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

ATGGGCAACC ACACGTGGGA GGGCTGCCAC GTGGACTCGC GCGTGGACCA CCTCTTTCCG 60 

CCATCCCTCT ACATCTTTGT CATCGGCGTG GGGCTGCCCA CCAACTGCCT GGCTCTGTGG 120 

GCGGCCTACC GCCAGGTGCA ACAGCGCAAC GAGCTGGGCG TCTACCTGAT GAACCTCAGC 180 

ATCGCCGACC TGCTGTACAT CTGCACGCTG CCGCTGTGGG TGGACTACTT CCTGCACCAC 240 

GACAACTGGA TCCACGGCCC CGGGTCCTGC AAGCTCTTTG GGTTCATCTT CTACACCAAT 300 

ATCTACATCA GCATCGCCTT CCTGTGCTGC ATCTCGGTGG ACCGCTACCT GGCTGTGGCC 360 

CACCCACTCC GCTTCGCCCG CCTGCGCCGC GTCAAGACCG CCGTGGCCGT GAGCTCCGTG 420 

GTCTGGGCCA CGGAGCTGGG CGCCAACTCG GCGCCCCTGT TCCATGACGA GCTCTTCCGA 480 

GACCGCTACA ACCACACCTT CTGCTTTGAG AAGTTCCCCA TGGAAGGCTG GGTGGCCTGG 540 

ATGAACCTCT ATCGGGTGTT CGTGGGCTTC CTCTTCCCGT GGGCGCTCAT GCTGCTGTCG 600 

TACCGGGGCA TCCTGCGGGC CGTGCGGGGC AGCGTGTCCA CCGAGCGCCA GGAGAAGGCC 660 

AAGATCAAGC GGCTGGCCCT CAGCCTCATC GCCATCGTGC TGGTCTGCTT TGCGCCCTAT 720 

CACGTGCTCT TGCTGTCCCG CAGCGCCATC TACCTGGGCC GCCCCTGGGA CTGCGGCTTC 780 

GAGGAGCGCG TCTTTTCTGC ATACCACAGC TCACTGGCTT TCACCAGCCT CAACTGTGTG 840 

GCGGACCCCA TCCTCTACTG CCTGGTCAAC GAGGGCGCCC GCAGCGATGT GGCCAAGGCC 900 

CTGCACAACC TGCTCCGCTT TCTGGCCAGC GACAAGCCCC AGGAGATGGC CAATGCCTCG 960 

CTCACCCTGG AGACCCCACT CACCTCCAAG AGGAACAGCA CAGCCAAAGC CATGACTGGC 1020 

AGCTGGGCGG CCACTCCGCC TTCCCAGGGG GACCAGGTGC AGCTGAAGAT GCTGCCGCCA 1080 

GCACAATGA 1089 



345 



350 
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) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Gly Asn His Thr Trp Glu Gly Cys His Val Asp Ser Arg Val Asp 
1 5 10 15 

His Leu Phe Pro Pro Ser Leu Tyr He Phe Val He Gly Val Gly Leu 
20 25 30 

Pro Thr Asn Cys Leu Ala Leu Trp Ala Ala Tyr Arg Gin Val Gin Gin 
35 40 45 

Arg Asn Glu Leu Gly Val Tyr Leu Met Asn Leu Ser He Ala Asp Leu 
50 55 60 

Leu Tyr He Cys Thr Leu Pro Leu Trp Val Asp Tyr Phe Leu His His 
65 70 75 80 

Asp Asn Trp He His Gly Pro Gly Ser Cys Lys Leu Phe Gly Phe He 
85 90 95 

Phe Tyr Thr Asn He Tyr He Ser He Ala Phe Leu Cys Cys He Ser 
100 105 no 

Val Asp Arg Tyr Leu Ala Val Ala His Pro Leu Arg Phe Ala Arg Leu 
115 120 125 

Arg Arg Val Lys Thr Ala Val Ala Val Ser Ser Val Val Trp Ala Thr 
130 135 140 

Glu Leu Gly Ala Asn Ser Ala Pro Leu Phe His Asp Glu Leu Phe Arg 
145 150 155 160 

Asp Arg Tyr Asn His Thr Phe Cys Phe Glu Lys Phe Pro Met Glu Gly 



Pro Trp Ala Leu Met Leu Leu Ser Tyr Arg Gly lie Leu Arg Ala Val 
195 200 205 

Arg Gly Ser Val Ser Thr Glu Arg Gin Glu Lys Ala Lys He Lys Arg 
210 215 220 

Leu Ala Leu Ser Leu He Ala He Val Leu Val Cys Phe Ala Pro Tyr 



Trp Val Ala Trp Met Asn Leu Tyr Arg Val Phe Val Gly Phe Leu Phe 
180 185 190 



WO 00/22129 



PCT/US99/23938 



6 

225 230 

His Val Leu Leu Leu Ser 
245 

Asp Cys Gly Phe Glu Glu 
260 

Ala Phe Thr Ser Leu Asn 
275 

Val Asn Glu Gly Ala Arg 
290 

Leu Arg Phe Leu Ala Ser 
305 310 

Leu Thr Leu Glu Thr Pro 
325 

Ala Met Thr Gly Ser Trp 
340 

Val Gin Leu Lys Met Leu 
355 

(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TATGAATTCA GATGCTCTAA ACGTCCCTGC 30 

(7) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TCCGGATCCA CCTGCACCTG CGCCTGCACC 30 

(8) INFORMATION FOR SEQ ID NO : 7: 



235 



240 



Arg Ser Ala lie Tyr Leu Gly Arg Pro Trp 
250 255 

Arg Val Phe Ser Ala Tyr His Ser Ser Leu 
265 270 

Cys Val Ala Asp Pro lie Leu Tyr Cys Leu 
280 285 

Ser Asp Val Ala Lys Ala Leu His Asn Leu 
295 300 

Asp Lys Pro Gin Glu Met Ala Asn Ala Ser 
315 320 

Leu Thr Ser Lys Arg Asn Ser Thr Ala Lys 
330 335 

Ala Ala Thr Pro Pro Ser Gin Gly Asp Gin 
345 350 



Pro Pro Ala Gin 
360 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1002 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 7: 

ATGGAGTCCT CAGGCAACCC AGAGAGCACC ACCTTTTTTT ACTATGACCT TCAGAGCCAG 60 

CCGTGTGAGA ACCAGGCCTG GGTCTTTGCT ACCCTCGCCA CCACTGTCCT GTACTGCCTG 120 

10 GTGTTTCTCC TCAGCCTAGT GGGCAACAGC CTGGTCCTGT GGGTCCTGGT GAAGTATGAG 180 

AGCCTGGAGT CCCTCACCAA CATCTTCATC CTCAACCTGT GCCTCTCAGA CCTGGTGTTC 240 

GCCTGCTTGT TGCCTGTGTG GATCTCCCCA TACCACTGGG GCTGGGTGCT GGGAGACTTC 300 

CTCTGCAAAC TCCTCAATAT GATCTTCTCC ATCAGCCTCT ACAGCAGCAT CTTCTTCCTG 360 

ACCATCATGA CCATCCACCG CTACCTGTCG GTAGTGAGCC CCCTCTCCAC CCTGCGCGTC 420 

15 CCCACCCTCC GCTGCCGGGT GCTGGTGACC ATGGCTGTGT GGGTAGCCAG CATCCTGTCC 480 

TCCATCCTCG ACACCATCTT CCACAAGGTG CTTTCTTCGG GCTGTGATTA TTCCGAACTC 540 

ACGTGGTACC TCACCTCCGT CTACCAGCAC AACCTCTTCT TCCTGCTGTC CCTGGGGATT 600 

ATCCTGTTCT GCTACGTGGA GATCCTCAGG ACCCTGTTCC GCTCACGCTC CAAGCGGCGC 660 

CACCGCACGG TCAAGCTCAT CTTCGCCATC GTGGTGGCCT ACTTCCTCAG CTGGGGTCCC 720 

20 TACAACTTCA CCCTGTTTCT GCAGACGCTG TTTCGGACCC AGATCATCCG GAGCTGCGAG 780 

GCCAAACAGC AGCTAGAATA CGCCCTGCTC ATCTGCCGCA ACCTCGCCTT CTCCCACTGC 840 

TGCTTTAACC CGGTGCTCTA TGTCTTCGTG GGGGTCAAGT TCCGCACACA CCTGAAACAT 900 

GTTCTCCGGC AGTTCTGGTT CTGCCGGCTG CAGGCACCCA GCCCAGCCTC GATCCCCCAC 960 

TCCCCTGGTG CCTTCGCCTA TGAGGGCGCC TCCTTCTACT GA 1002 

25 (9) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

30 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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Met Glu Ser Ser Gly Asn Pro Glu Ser Thr Thr Phe Phe Tyr Tyr Asp 
1 5 10 15 

Leu Gin Ser Gin Pro Cys Glu Asn Gin Ala Trp Val Phe Ala Thr Lou 
20 25 30 

Ala Thr Thr Val Leu Tyr Cys Leu Val Phe Leu Leu Ser Leu Val Gly 
35 40 45 

Asn Ser Leu Val Leu Trp Val Leu Val Lys Tyr Glu Ser Leu Glu Ser 
50 55 60 

Leu Thr Asn He Phe He Leu Asn Leu Cys Leu Ser Asp Leu Val Phe 
65 70 75 80 

Ala Cys Leu Leu Pro Val Trp He Ser Pro Tyr His Trp Gly Trp Val 
85 90 95 

Leu Gly Asp Phe Leu Cys Lys Leu Leu Asn Met He Phe Ser He Ser 
100 105 no 

Leu Tyr Ser Ser He Phe Phe Leu Thr He Met Thr He His Arg Tyr 
115 120 125 

Leu Ser Val Val Ser Pro Leu Ser Thr Leu Arg Val Pro Thr Leu Arg 
130 135 140 

Cys Arg Val Leu Val Thr Met Ala Val Trp Val Ala Ser He Leu Ser 
145 150 155 160 

Ser He Leu Asp Thr He Phe His Lys Val Leu Ser Ser Gly Cys Asp 
165 170 175 

Tyr Ser Glu Leu Thr Trp Tyr Leu Thr Ser Val Tyr Gin His Asn Leu 
180 185 190 

Phe Phe Leu Leu Ser Leu Gly He He Leu Phe Cys Tyr Val Glu He 
195 200 205 

Leu Arg Thr Leu Phe Arg Ser Arg Ser Lys Arg Arg His Arg Thr Val 
210 215 220 

Lys Leu He Phe Ala He Val Val Ala Tyr Phe Leu Ser Trp Gly Pro 
225 230 235 " 240 

Tyr Asn Phe Thr Leu Phe Leu Gin Thr Leu Phe Arg Thr Gin He He 
245 250 255 

Arg Ser Cys Glu Ala Lys Gin Gin Leu Glu Tyr Ala Leu Leu He Cys 
260 265 270 

Arg Asn Leu Ala Phe Ser His Cys Cys Phe Asn Pro Val Leu Tyr Val 
275 280 285 
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Phe Val Gly Val Lys Phe Arg Thr His Leu Lys His Val Leu Arg Glh 
290 295 300 

Phe Trp Phe Cys Arg Leu Gin Ala Pro Ser Pro Ala Ser lie Pro His 
305 310 315 320 

5 Ser Pro Gly Ala Phe Ala Tyr Glu Gly Ala Ser Phe Tyr 

325 330 

(10) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 

10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
15 GCAAGCTTGG GGGACGCCAG GTCGCCGGCT 30 

(11) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GCGGATCCGG ACGCTGGGGG AGTCAGGCTG C 31 
25 (12) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATGGACAACG CCTCGTTCTC GGAGCCCTGG CCCGCCAACG CATCGGGCCC GGACCCGGCG 60 
CTGAGCTGCT CCAACGCGTC GACTCTGGCG CCGCTGCCGG CGCCGCTGGC GGTGGCTGTA 120 



35 



CCAGTTGTCT ACGCGGTGAT CTGCGCCGTG GGTCTGGCGG GCAACTCCGC CGTGCTGTAC 180 
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GTGTTGCTGC GGGCGCCCCG CATGAAGACC GTCACCAACC TGTTCATCCT CAACCTGGCC 
ATCGCCGACG AGCTCTTCAC GCTGGTGCTG CCCATCAACA TCGCCGACTT CCTGCTGCGG 
CAGTGGCCCT TCGGGG*GCT CATGTGCAAG CTCATCGTGG CTATCGACCA GTACAACACC 
TTCTCCAGCC TCTACTTCCT CACCGTCATG AGCGCCGACC GCTACCTGGT GGTGTTGGCC 
5 ACTGCGGAGT CGCGCCGGGT GGCCGGCCGC ACCTACAGCG CCGCGCGCGC GGTGAGCCTG 
GCCGTGTGGG GGATCGTCAC ACTCGTCGTG CTGCCCTTCG CAGTCTTCGC CCGGCTAGAC 
GACGAGCAGG GCCGGCGCCA GTGCGTGCTA GTCTTTCCGC AGCCCGAGGC CTTCTGGTGG 
CGCGCGAGCC GCCTCTACAC GCTCGTGCTG GGCTTCGCCA TCCCCGTGTC CACCATCTGT 
GTCCTCTATA CCACCCTGCT GTGCCGGCTG CATGCCATGC GGCTGGACAG CCACGCCAAG 

10 GCCCTGGAGC GCGCCAAGAA GCGGGTGACC TTCCTGGTGG TGGCAATCCT GGCGGTGTGC 
CTCCTCTGCT GGACGCCCTA CCACCTGAGC ACCGTGGTGG CGCTCACCAC CGACCTCCCG 
CAGACGCCGC TGGTCATCGC TATCTCCTAC TTCATCACCA GCCTGACGTA CGCCAACAGC 
TGCCTCAACC CCTTCCTCTA CGCCTTCCTG GACGCCAGCT TCCGCAGGAA CCTCCGCCAG 
CTGATAACTT GCCGCGCGGC AGCCTGA 

15 (13) INFORMATION FOR SEQ ID NO: 12: 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
987 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 328 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Met Asp Asn Ala Ser Phe Ser Glu Pro Trp Pro Ala Asn Ala Ser Gly 
15 10 15 



25 



Pro Asp Pro Ala Leu Ser Cys Ser Asn Ala Ser Thr Leu Ala Pro Leu 
20 25 30 



Pro Ala Pro Leu Ala Val Ala Val Pro Val Val Tyr Ala Val He Cys 
35 40 45 



30 



Ala Val Gly Leu Ala Gly Asn Ser Ala Val Leu Tyr Val Leu Leu Arg 
50 55 60 



Ala Pro Arg Met Lys Thr Val Thr Asn Leu Phe He Leu Asn Leu Ala 
65 70 75 80 
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lie Ala Asp Glu Leu Phe Thr Leu Val Leu Pro lie Asn He Ala Asp 
85 90 95 

Phe Leu Leu Arg Gin Trp Pro Phe Gly Glu Leu Met Cys Lys Leu He 
100 105 110 

5 Val Ala He Asp Gin Tyr Asn Thr Phe Ser Ser Leu Tyr Phe lieu Thr 

115 120 125 

Val Met Ser Ala Asp Arg Tyr Leu Val Val Leu Ala Thr Ala Glu Ser 
130 135 140 

Arg Arg Val Ala Gly Arg Thr Tyr Ser Ala Ala Arg Ala Val Ser Leu 
10 145 150 155 160 

Ala Val Trp Gly He Val Thr Leu Val Val Leu Pro Phe Ala Val Phe 
165 170 175 

Ala Arg Leu Asp Asp Glu Gin Gly Arg Arg Gin Cys Val Leu Val Phe 
180 185 190 

15 Pro Gin Pro Glu Ala Phe Trp Trp Arg Ala Ser Arg Leu Tyr Thr Leu 

195 200 205 

Val Leu Gly Phe Ala He Pro Val Ser Thr He Cys Val Leu Tyr Thr 
210 215 220 

Thr Leu Leu Cys Arg Leu His Ala Met Arg Leu Asp Ser His Ala Lys 
20 225 230 235 240 

Ala Leu Glu Arg Ala Lys Lys Arg Val Thr Phe Leu Val Val Ala He 
245 250 255 

Leu Ala Val Cys Leu Leu Cys Trp Thr Pro Tyr His Leu Ser Thr Val 
260 265 270 

25 Val Ala Leu Thr Thr Asp Leu Pro Gin Thr Pro Leu Val He Ala He 

275 280 285 

Ser Tyr Phe He Thr Ser Leu Thr Tyr Ala Asn Ser Cys Leu Asn Pro 
290 295 300 

Phe Leu Tyr Ala Phe Leu Asp Ala Ser Phe Arg Arg Asn Leu Arg Gin 
30 305 310 315 320 

Leu He Thr Cys Arg Ala Ala Ala 
325 

(14) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGGAATTCGT CAACGGTCCC AGCTACAATG 30 

(15) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

ATGGATCCCA GGCCCTTCAG CACCGCAATA T 31 

(16) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1002 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATGCAGGCCG CTGGGCACCC AGAGCCCCTT GACAGCAGGG GCTCCTTCTC 
ATGGGTGCCA ACGTCTCTCA GGACAATGGC ACTGGCCACA ATGCCACCTT 
CTGCCGTTCC TCTATGTGCT CCTGCCCGCC GTGTACTCCG GGATCTGTGC 
ACTGGCAACA CGGCCGTCAT CCTTGTAATC CTAAGGGCGC CCAAGATGAA 
AACGTGTTCA TCCTGAACCT GGCCGTCGCC GACGGGCTCT TCACGCTGGT 
AACATCGCGG AGCACCTGCT GCAGTACTGG CCCTTCGGGG AGCTGCTCTG 
CTGGCCGTCG ACCACTACAA CATCTTCTCC AGCATCTACT TCCTAGCCGT 
GACCGATACC TGGTGGTGCT GGCCACCGTG AGGTCCCGCC ACATGCCCTG 
CGGGGGGCGA AGGTCGCCAG CCTGTGTGTC TGGCTGGGCG TCACGGTCCT 
TTCTTCTCTT TCGCTGGCGT CTACAGCAAC GAGCTGCAGG TCCCAAGCTG 
TTCCCGTGGC CCGAGCGGGT CTGGTTCAAG GCCAGCCGTG TCTACACTTT 
TTCGTGCTGC CCGTGTGCAC CATCTGTGTG CTCTACACAG ACCTCCTGCG 



CCTCCCCACG 60 

CTCCGAGCCA 120 

TGTGGGGCTG 180 

GACGGTGACC 240 

ACTGCCCGTC 300 

CAAGCTGGTG 360 

GATGAGCGTG 420 

GCGCACCTAC 480 

GGTTCTGCCC 540 

TGGGCTGAGC 600 

GGTCCTGGGC 660 

CAGGCTGCGG 720 
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GCCGTGCGGC TCCGCTCTGG AGCCAAGGCT CTAGGCAAGG CCAGGCGGAA GGTGACCGTC 780 

CTGGTCCTCG TCGTGCTGGC CGTGTGCCTC CTCTGCTGGA CGCCCTTCCA CCTGGCCTCT 840 

GTCGTGGCCC TGACCACGGA CCTGCCCCAG ACCCCACTGG TCATCAGTAT GTCCTACGTC 900 

ATCACCAGCC TCACGTACGC CAACTCGTGC CTGAACCCCT TCCTCTACGC CTTTCTAGAT 960 



(17) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Gin Ala Ala Gly His Pro Glu Pro Leu Asp Ser Arg Gly Ser Phe 
1 5 10 15 

Ser Leu Pro Thr Met Gly Ala Asn Val Ser Gin Asp Asn Gly Thr Gly 
20 25 30 

His Asn Ala Thr Phe Ser Glu Pro Leu Pro Phe Leu Tyr Val Leu Leu 
35 40 45 

Pro Ala Val Tyr Ser Gly lie Cys Ala Val Gly Leu Thr Gly Asn Thr 
50 55 60 

Ala Val lie Leu Val lie Leu Arg Ala Pro Lys Met Lys Thr Val Thr 
65 70 75 80 

Asn Val Phe lie Leu Asn Leu Ala Val Ala Asp Gly Leu Phe Thr Leu 
85 90 95 

Val Leu Pro Val Asn lie Ala Glu His Leu Leu Gin Tyr Trp Pro Phe 
100 105 110 

Gly Glu Leu Leu Cys Lys Leu Val Leu Ala Val Asp His Tyr Asn lie 
115 120 125 

Phe Ser Ser lie Tyr Phe Leu Ala Val Met Ser Val Asp Arg Tyr Leu 
130 135 140 

Val Val Leu Ala Thr Val Arg Ser Arg His Met Pro Trp Arg Thr Tyr 
145 150 155 160 

Arg Gly Ala Lys Val Ala Ser Leu Cys Val Trp Leu Gly Val Thr Val 



GACAACTTCC GGAAGAACTT CCGCAGCATA TTGCGGTGCT GA 



1002 



165 



170 



175 
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Leu Val Leu Pro Phe Phe Ser Phe Ala Gly Val Tyr Ser Asn Glu Leu 
180 185 190 

Gin Val Pro Ser Cys Gly Leu Ser Phe Pro Trp Pro Glu Arg Val Trp 
195 200 205 

Phe Lys Ala Ser Arg Val Tyr Thr Leu Val Leu Gly Phe Val Leu Pro 
210 215 220 

Val Cys Thr lie Cys Val Leu Tyr Thr Asp Leu Leu Arg Arg Leu Arg 
225 230 235 240 

Ala Val Arg Leu Arg Ser Gly Ala Lys Ala Leu Gly Lys Ala Arg Arg 
245 250 255 

Lys Val Thr Val Leu Val Leu Val Val Leu Ala Val Cys Leu Leu Cys 
260 265 270 

Trp Thr Pro Phe His Leu Ala Ser Val Val Ala Leu Thr Thr Asp Leu 
275 280 285 

Pro Gin Thr Pro Leu Val lie Ser Met Ser Tyr Val lie Thr Ser Leu 
290 295 300 

Thr Tyr Ala Asn Ser Cys Leu Asn Pro Phe Leu Tyr Ala Phe Leu Asp 
305 310 315 320 

Asp Asn Phe Arg Lys Asn Phe Arg Ser lie Leu Arg Cys 
325 330 

(18) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ACGAATTCAG CCATGGTCCT TGAGGTGAGT GACCACCAAG TGCTAAAT 48 

(19) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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GAGGATCCTG GAATGCGGGG AAGTCAG 



27 



10 



15 



20 



(20) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION? SEQ ID NO: 19: 

ATGGTCCTTG AGGTGAGTGA CCACCAAGTG CTAAATGACG CCGAGGTTGC CGCCCTCCTG 60 

GAGAACTTCA GCTCTTCCTA TGACTATGGA GAAAACGAGA GTGACTCGTG CTGTACCTCC 120 

CCGCCCTGCC CACAGGACTT CAGCCTGAAC TTCGACCGGG CCTTCCTGCC AGCCCTCTAC 180 

AGCCTCCTCT TTCTGCTGGG GCTGCTGGGC AACGGCGCGG TGGCAGCCGT GCTGCTGAGC 240 

CGGCGGACAG CCCTGAGCAG CACCGACACC TTCCTGCTCC ACCTAGCTGT AGCAGACACG 300 

CTGCTGGTGC TGACACTGCC GCTCTGGGCA GTGGACGCTG CCGTCCAGTG GGTCTTTGGC 360 

TCTGGCCTCT GCAAAGTGGC AGGTGCCCTC TTCAACATCA ACTTCTACGC AGGAGCCCTC 420 

CTGCTGGCCT GCATCAGCTT TGACCGCTAC CTGAACATAG TTCATGCCAC CCAGCTCTAC 480 

CGCCGGGGGC CCCCGGCCCG CGTGACCCTC ACCTGCCTGG CTGTCTGGGG GCTCTGCCTG 54 0 

CTTTTCGCCC TCCCAGACTT CATCTTCCTG TCGGCCCACC ACGACGAGCG CCTCAACGCC 600 

ACCCACTGCC AATACAACTT CCCACAGGTG GGCCGCACGG CTCTGCGGGT GCTGCAGCTG 660 

GTGGCTGGCT TTCTGCTGCC CCTGCTGGTC ATGGCCTACT GCTATGCCCA CATCCTGGCC 720 

GTGCTGCTGG TTTCCAGGGG CCAGCGGCGC CTGCGGGCCA TGCGGCTGGT GGTGGTGGTC 780 

GTGGTGGCCT TTGCCCTCTG CTGGACCCCC TATCACCTGG TGGTGCTGGT GGACATCCTC 840 

ATGGACCTGG GCGCTTTGGC CCGCAACTGT GGCCGAGAAA GCAGGGTAGA CGTGGCCAAG 900 

TCGGTCACCT CAGGCCTGGG CTACATGCAC TGCTGCCTCA ACCCGCTGCT CTATGCCTTT 960 

GTAGGGGTCA AGTTCCGGGA GCGGATGTGG ATGCTGCTCT TGCGCCTGGG CTGCCCCAAC 1020 

CAGAGAGGGC TCCAGAGGCA GCCATCGTCT TCCCGCCGGG ATTCATCCTG GTCTGAGACC 1080 

TCAGAGGCCT CCTACTCGGG CTTGTGA 1107 
(21) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 368 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Val Leu Glu Val Ser Asp His Gin Val Leu Asn Asp Ala Glu Val 
15 10 15 

Ala Ala Leu Leu Glu Asn Phe Ser Ser Ser Tyr Asp Tyr Gly Glu Asn 
20 25 30 

Glu Ser Asp Ser Cys Cys Thr Ser Pro Pro Cys Pro Gin Asp Phe Ser 
35 40 45 

Leu Asn Phe Asp Arg Ala Phe Leu Pro Ala Leu Tyr Ser Leu Leu Phe 
50 55 60 

Leu Leu Gly Leu Leu Gly Asn Gly Ala Val Ala Ala Val Leu Leu Ser 
65 70 75 80 

Arg Arg Thr Ala Leu Ser Ser Thr Asp Thr Phe Leu Leu His Leu Ala 
85 90 95 

Val Ala Asp Thr Leu Leu Val Leu Thr Leu Pro Leu Trp Ala Val Asp 
100 105 110 

Ala Ala Val Gin Trp Val Phe Gly Ser Gly Leu Cys Lys Val Ala Gly 
115 120 125 

Ala Leu Phe Asn lie Asn Phe Tyr Ala Gly Ala Leu Leu Leu Ala Cys 
130 135 140 

lie Ser Phe Asp Arg Tyr Leu Asn He Val His Ala Thr Gin Leu Tyr 
145 150 155 160 

Arg Arg Gly Pro Pro Ala Arg Val Thr Leu Thr Cys Leu Ala Val Trp 
165 170 175 

Gly Leu Cys Leu Leu Phe Ala Leu Pro Asp Phe He Phe Leu Ser Ala 
180 185 190 

His His Asp Glu Arg Leu Asn Ala Thr His Cys Gin Tyr Asn Phe Pro 
195 200 205 

Gin Val Gly Arg Thr Ala Leu Arg Val Leu Gin Leu Val Ala Gly Phe 
210 215 220 

Leu Leu Pro Leu Leu Val Met Ala Tyr Cys Tyr Ala His He Leu Ala 
225 230 235 240 
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Val Leu Leu Val Ser Arg Gly Gin Arg Arg Leu Arg Ala Met Arg Leu 
245 250 255 

Val Val Val Val Val Val Ala Phe Ala Leu Cys Trp Thr Pro Tyr His 
260 265 270 

5 Leu Val Val Leu Val Asp lie Leu Met Asp Leu Gly Ala Leu Ala Arg 

275 280 285 

Asn Cys Gly Arg Glu Ser Arg Val Asp Val Ala Lys Ser Val Thr Ser 
290 295 300 

Gly Leu Gly Tyr Met His Cys Cys Leu Asn Pro Leu Leu Tyr Ala Phe 
10 305 310 315 320 

Val Gly Val Lys Phe Arg Glu Arg Met Trp Met Leu Leu Leu Arg Leu 
325 330 335 

Gly Cys Pro Asn Gin Arg Gly Leu Gin Arg Gin Pro Ser Ser Ser Arg 
340 345 350 

15 Arg Asp Ser Ser Trp Ser Glu Thr Ser Glu Ala Ser Tyr Ser Gly Leu 

355 360 365 

(22) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 

20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
25 TTAAGCTTGA CCTAATGCCA TCTTGTGTCC 30 
(23) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
TTGGATCCAA AAGAACCATG CACCTCAGAG 30 
35 (24) INFORMATION FOR SEQ ID NO:23: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1074 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



5 



(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
ATGGCTGATG ACTATGGCTC TGAATCCACA TCTTCCATGG AAGACTACGT TAACTTCAAC 
TTCACTGACT TCTACTGTGA GAAAAACAAT GTCAGGCAGT TTGCGAGCCA TTTCCTCCCA 
CCCTTGTACT GGCTCGTGTT CATCGTGGGT GCCTTGGGCA ACAGTCTTGT TATCCTTGTC 

10 TACTGGTACT GCACAAGAGT GAAGACCATG ACCGACATGT TCCTTTTGAA TTTGGCAATT 
GCTGACCTCC TCTTTCTTGT CACTCTTCCC TTCTGGGCCA TTGCTGCTGC TGACCAGTGG 
AAGTTCCAGA CCTTCATGTG CAAGGTGGTC AACAG CATGT ACAAGATGAA CTTCTACAGC 
TGTGTGTTGC TGATCATGTG CATCAGCGTG GACAGGTACA TTGCCATTGC CCAGGCCATG 
AGAGCACATA CTTGGAGGGA GAAAAGGCTT TTGTACAGCA AAATGGTTTG CTTTACCATC 

15 TGGGTATTGG CAGCTGCTCT CTGCATCCCA GAAATCTTAT ACAGCCAAAT CAAGGAGGAA 
TCCGGCATTG CTATCTGCAC CATGGTTTAC CCTAGCGATG AGAGCACCAA ACTGAAGTCA 
GCTGTCTTGA CCCTGAAGGT CATTCTGGGG TTCTTCCTTC CCTTCGTGGT CATGGCTTGC 
TGCTATACCA TCATCATTCA CACCCTGATA CAAGCCAAGA AGTCTTCCAA GCACAAAGCC 
CTAAAAGTGA CCATCACTGT CCTGACCGTC TTTGTCTTGT CTCAGTTTCC CTACAACTGC 

20 ATTTTGTTGG TGCAGACCAT TGACGCCTAT GCCATGTTCA TCTCCAACTG TGCCGTTTCC 
ACCAACATTG ACATCTGCTT CCAGGTCACC CAGACCATCG CCTTCTTCCA CAGTTGCCTG 
AACCCTGTTC TCTATGTTTT TGTGGGTGAG AGATTCCGCC GGGATCTCGT GAAAACCCTG 
AAGAACTTGG GTTGCATCAG CCAGGCCCAG TGGGTTTCAT TTACAAGGAG AGAGGGAAGC 
TTGAAGCTGT CGTCTATGTT GCTGGAGACA ACCTCAGGAG CACTCTCCCT CTGA 

25 (25) INFORMATION FOR SEQ ID NO: 24: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1074 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 24: 

Met Ala Asp Asp Tyr Gly Ser Glu Ser Thr Ser Ser Met Glu Asp Tyr 
1 5 10 15 

Val Asn Phe Asn Phe Thr Asp Phe Tyr Cys Glu Lys Asn Asn Val Arg 
20 25 30 

Gin Phe Ala Ser His Phe Leu Pro Pro Leu Tyr Trp Leu Val Phe lie 
35 40 45 

Val Gly Ala Leu Gly Asn Ser Leu Val lie Leu Val Tyr Trp Tyr Cys 
50 55 60 

Thr Arg Val Lys Thr Met Thr Asp Met Phe Leu Leu Asn Leu Ala lie 
65 70 75 80 

Ala Asp Leu Leu Phe Leu Val Thr Leu Pro Phe Trp Ala lie Ala Ala 
85 90 95 

Ala Asp Gin Trp Lys Phe Gin Thr Phe Met Cys Lys Val Val Asn Ser 
100 105 110 

Met Tyr Lys Met Asn Phe Tyr Ser Cys Val Leu Leu lie Met Cys He 
115 120 125 

Ser Val Asp Arg Tyr He Ala He Ala Gin Ala Met Arg Ala His Thr 
130 135 140 

Trp Arg Glu Lys Arg Leu Leu Tyr Ser Lys Met Val Cys Phe Thr He 
145 150 155 160 

Trp Val Leu Ala Ala Ala Leu Cys He Pro Glu He Leu Tyr Ser Gin 
165 170 175 

He Lys Glu Glu Ser Gly He Ala He Cys Thr Met Val Tyr Pro Ser 
180 185 190 

Asp Glu Ser Thr Lys Leu Lys Ser Ala Val Leu Thr Leu Lys Val He 
195 200 205 

Leu Gly Phe Phe Leu Pro Phe Val Val Met Ala Cys Cys Tyr Thr He 
210 215 . 220 

He He His Thr Leu He Gin Ala Lys Lys Ser Ser Lys His Lys Ala 
225 230 235 240 

Leu Lys Val Thr He Thr Val Leu Thr Val Phe Val Leu Ser Gin Phe 
245 250 255 

Pro Tyr Asn Cys He Leu Leu Val Gin Thr He Asp Ala Tyr Ala Met 
260 265 270 



Phe He Ser Asn Cys Ala Val Ser Thr Asn He Asp He Cys Phe Gin 
275 280 285 
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Val Thr Gin Thr lie Ala Phe Phe His Ser Cys Leu Asn Pro Val Leu 
290 295 300 

Tyr Val Phe Val Gly Glu Arg Phe Arg Arg Asp Leu Val Lys Thr Leu 
305 310 315 320 

5 Lys Asn Leu Gly Cys lie Ser Gin Ala Gin Trp Val Ser Phe Thr Arg 

325 330 335 

Arg Glu Gly Ser Leu Lys Leu Ser Ser Met Leu Leu Glu Thr Thr Ser 
340 345 350 

Gly Ala Leu Ser Leu 
10 355 

(26) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1110 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATGGCCTCAT CGACCACTCG GGGCCCCAGG GTTTCTGACT TATTTTCTGG GCTGCCGCCG 60 

20 GCGGTCACAA CTCCCGCCAA CCAGAGCGCA GAGGCCTCGG CGGGCAACGG GTCGGTGGCT 120 

GGCGCGGACG CTCCAGCCGT CACGCCCTTC CAGAGCCTGC AGCTGGTGCA TCAGCTGAAG 180 

GGGCTGATCG TGCTGCTCTA CAGCGTCGTG GTGGTCGTGG GGCTGGTGGG CAACTGCCTG 240 

CTGGTGCTGG TGATCGCGCG GGTGCCGCGG CTGCACAACG TGACGAACTT CCTCATCGGC 300 

AACCTGGCCT TGTCCGACGT GCTCATGTGC ACCGCCTGCG TGCCGCTCAC GCTGGCCTAT 360 

25 GCCTTCGAGC CACGCGGCTG GGTGTTCGGC GGCGGCCTGT GCCACCTGGT CTTCTTCCTG 420 

CAGCCGGTCA CCGTCTATGT GTCGGTGTTC ACGCTCACCA CCATCGCAGT GGACCGCTAC 480 

GTCGTGCTGG TGCACCCGCT GAGGCGCGCA TCTCGCTGCG CCTCAGCCTA CGCTGTGCTG 540 

GCCATCTGGG CGCTGTCCGC GGTGCTGGCG CTGCCGCCCG CCGTGCACAC CTATCACGTG 600 

GAGCTCAAGC CGCACGACGT GCGCCTCTGC GAGGAGTTCT GGGGCTCCCA GGAGCGCCAG 660 

30 CGCCAGCTCT ACGCCTGGGG GCTGCTGCTG GTCACCTACC TGCTCCCTCT GCTGGTCATC 720 

CTCCTGTCTT ACGTCCGGGT GTCAGTGAAG CTCCGCAACC GCGTGGTGCC GGGCTGCGTG 780 

ACCCAGAGCC AGGCCGACTG GGACCGCGCT CGGCGCCGGC GCACCTTCTG CTTGCTGGTG 840 
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GTGGTCGTGG TGGTGTTCGC CGTCTGCTGG CTGCCGCTGC ACGTCTTCAA CCTGCTGCGG 900 

GACCTCGACC CCCACGCCAT CGACCCTTAC GCCTTTGGGC TGGTGCAGCT GCTCTGCCAC 960 
TGGCTCGCCA TGAGTTCGGC CTGCTACAAC CCCTTCATCT ACGCCTGGCT GCACGACAGC 1020 

TTCCGCGAGG AGCTGCGCAA ACTGTTGGTC GCTTGGCCCC GCAAGATAGC CCCCCATGGC 1080 

CAGAATATGA CCGTCAGCGT GGTCATCTGA 1110 
(27) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Ala Ser Ser Thr Thr Arg Gly Pro Arg Val Ser Asp Leu Phe Ser 
15 10 15 

Gly Leu Pro Pro Ala Val Thr Thr Pro Ala Asn Gin Ser Ala Glu Ala 
20 25 30 

Ser Ala Gly Asn Gly Ser Val Ala Gly Ala Asp Ala Pro Ala Val Thr 
35 40 45 

Pro Phe Gin Ser Leu Gin Leu Val His Gin Leu Lys Gly Leu lie Val 
50 55 60 

Leu Leu Tyr Ser Val Val Val Val Val Gly Leu Val Gly Asn Cys Leu 
65 70 75 80 

Leu Val Leu Val lie Ala Arg Val Pro Arg Leu His Asn Val Thr Asn 
85 90 95 

Phe Leu lie Gly Asn Leu Ala Leu Ser Asp Val Leu Met Cys Thr Ala 
100 105 110 

Cys Val Pro Leu Thr Leu Ala Tyr Ala Phe Glu Pro Arg Gly Trp Val 
115 120 125 

Phe Gly Gly Gly Leu Cys His Leu Val Phe Phe Leu Gin Pro Val Thr 
130 135 140 

Val Tyr Val Ser Val Phe Thr Leu Thr Thr lie Ala Val Asp Arg Tyr 
145 150 155 160 

Val Val Leu Val His Pro Leu Arg Arg Ala Ser Arg Cys Ala Ser Ala 
165 170 175 
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Tyr Ala Val Leu Ala He Trp Ala Leu Ser Ala Val Leu Ala Leu Pro 
180 185 190 

Pro Ala Val His Thr Tyr His Val Glu Leu Lys Pro His Asp Val Arg 
195 200 205 

5 Leu Cys Glu Glu Phe Trp Gly Ser Gin Glu Arg Gin Arg Gin Leu Tyr 

210 215 220 

Ala Trp Gly Leu Leu Leu Val Thr Tyr Leu Leu Pro Leu Leu Val He 
225 230 235 240 

Leu Leu Ser Tyr Val Arg Val Ser Val Lys Leu Arg Asn Arg Val Val 
10 245 250 255 

Pro Gly Cys Val Thr Gin Ser Gin Ala Asp Trp Asp Arg Ala Arg Arg 
260 265 270 

Arg Arg Thr Phe Cys Leu Leu Val Val Val Val Val Val Phe Ala Val 
275 280 285 

15 Cys Trp Leu Pro Leu His Val Phe Asn Leu Leu Arg Asp Leu Asp Pro 

290 295 300 

His Ala He Asp Pro Tyr Ala Phe Gly Leu Val Gin Leu Leu Cys His 
305 310 315 320 

Trp Leu Ala Met Ser Ser Ala Cys Tyr Asn Pro Phe He Tyr Ala Trp 
20 325 330 335 

Leu His Asp Ser Phe Arg Glu Glu Leu Arg Lys Leu Leu Val Ala Trp 
340 345 350 

Pro Arg Lys He Ala Pro His Gly Gin Asn Met Thr Val Ser Val Val 
355 360 , 365 

25 He 



(28) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1083 base pairs 

30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
35 ATGGACCCAG AAGAAACTTC AGTTTATTTG GATTATTACT ATGCTACGAG CCCAAACTCT 60 



GACATCAGGG AGACCCACTC CCATGTTCCT TACACCTCTG TCTTCCTTCC AGTCTTTTAC 120 



WO 00/22129 PCT/US99/23938 

23 

ACAGCTGTGT TCCTGACTGG AGTGCTGGGG AACCTTGTTC TCATGGGAGC GTTGCATTTC 180 

AAACCCGGCA GCCGAAGACT GATCGACATC TTTATCATCA ATCTGGCTGC CTCTGACTTC 240 

ATTTTTCTTG TCACATTGCC TCTCTGGGTG GATAAAGAAG CATCTCTAGG ACTGTGGAGG 300 

ACGGGCTCCT TCCTGTGCAA AGGGAGCTCC TACATGATCT CCGTCAATAT GCACTGCAGT 360 

5 GTCCTCCTGC TCACTTGCAT GAGTGTTGAC CGCTACCTGG CCATTGTGTG GCCAGTCGTA 420 

TCCAGGAAAT TCAGAAGGAC AGACTGTGCA TATGTAGTCT GTGCCAGCAT CTGGTTTATC 480 

TCCTGCCTGC TGGGGTTGCC TACTCTTCTG TCCAGGGAGC TCACGCTGAT TGATGATAAG 540 

CCATACTGTG CAGAGAAAAA GGCAACTCCA ATTAAACTCA TATGGTCCCT GGTGGCCTTA 600 

ATTTTCACCT TTTTTGTCCC TTTGTTGAGC ATTGTGACCT GCTACTGTTG CATTGCAAGG 660 

10 AAGCTGTGTG CCCATTACCA GCAATCAGGA AAGCACAACA AAAAGCTGAA GAAATCTATA 720 

AAGATCATCT TTATTGTCGT GGCAGCCTTT CTTGTCTCCT GGCTGCCCTT CAATACTTTC 780 

AAGTTCCTGG CCATTGTCTC TGGGTTGCGG CAAGAACACT ATTTACCCTC AGCTATTCTT 840 

CAGCTTGGTA TGGAGGTGAG TGGACCCTTG GCATTTGCCA ACAGCTGTGT CAACCCTTTC 900 

ATTTACTATA TCTTCGACAG CTACATCCGC CGGGCCATTG TCCACTGCTT GTGCCCTTGC ' 960 

15 CTGAAAAACT ATGACTTTGG GAGTAGCACT GAGACATCAG ATAGTCACCT CACTAAGGCT 1020 

CTCTCCACCT TCATTCATGC AGAAGATTTT GCCAGGAGGA GGAAGAGGTC TGTGTCACTC 1080 

TAA 1083 
(29) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 360 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Asp Pro Glu Glu Thr Ser Val Tyr Leu Asp Tyr Tyr Tyr Ala Thr 
15 10 15 

Ser Pro Asn Ser Asp lie Arg Glu Thr His Ser His Val Pro Tyr Thr 
20 25 30 

30 Ser Val Phe Leu Pro Val Phe Tyr Thr Ala Val Phe Leu Thr Gly Val 

35 40 45 
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Leu Gly Asn Leu Val Leu Met Gly Ala Leu His Phe Lys Pro Gly Ser 
50 55 60 

Arg Arg Leu lie Asp He Phe He He Asn Leu Ala Ala Ser Asp Phe 
65 70 75 80 

He Phe Leu Val Thr Leu Pro Leu Trp Val Asp Lys Glu Ala Ser Leu 
85 90 95 

Gly Leu Trp Arg Thr Gly Ser Phe Leu Cys Lys Gly Ser Ser Tyr Met 
100 105 110 

He Ser Val Asn Met His Cys Ser Val Leu Leu Leu Thr Cys Met Ser 
115 120 125 

Val Asp Arg Tyr Leu Ala He Val Trp Pro Val Val Ser Arg Lys Phe 
130 135 140 

Arg Arg Thr Asp Cys Ala Tyr Val Val Cys Ala Ser He Trp Phe He 
145 150 155 160 

Ser Cys Leu Leu Gly Leu Pro Thr Leu Leu Ser Arg Glu Leu Thr Leu 
165 170 175 

He Asp Asp Lys Pro Tyr Cys Ala Glu Lys Lys Ala Thr Pro He Lys 
180 185 190 

Leu He Trp Ser Leu Val Ala Leu He Phe Thr Phe Phe Val Pro Leu 
195 200 205 

Leu Ser He Val Thr Cys Tyr Cys Cys He Ala Arg Lys Leu Cys Ala 
210 215 220 

His Tyr Gin Gin Ser Gly Lys His Asn Lys Lys Leu Lys Lys Ser He 
225 230 235 240 

Lys He He Phe He Val Val Ala Ala Phe Leu Val Ser Trp Leu Pro 
245 250 255 

Phe Asn Thr Phe Lys Phe Leu Ala He Val Ser Gly Leu Arg Gin Glu 
260 265 270 

His Tyr Leu Pro Ser Ala He Leu Gin Leu Gly Met Glu Val Ser Gly 
275 280 285 

Pro Leu Ala Phe Ala Asn Ser Cys Val Asn Pro Phe He Tyr Tyr He 
290 295 300 

Phe Asp Ser Tyr He Arg Arg Ala He Val His Cys Leu Cys Pro Cys 
305 310 315 320 

Leu Lys Asn Tyr Asp Phe Gly Ser Ser Thr Glu Thr Ser Asp Ser His 
325 330 335 



Leu Thr Lys Ala Leu Ser Thr Phe lie His Ala Glu Asp Phe Ala Arg 
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340 345 350 

Arg Arg Lys Arg Ser Val Ser Leu 
355 360 

(3 0) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
CTAGAATTCT GACTCCAGCC AAAGCATGAA T 31 

(31) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GCTGGATCCT AAACAGTCTG CGCTCGGCCT 3 0 

(32) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATGAATGGCC TTGAAGTGGC TCCCCCAGGT CTGATCACCA ACTTCTCCCT GGCCACGGCA 60 

GAGCAATGTG GCCAGGAGAC GCCACTGGAG AACATGCTGT TCGCCTCCTT CTACCTTCTG 120 

GATTTTATCC TGGCTTTAGT TGGCAATACC CTGGCTCTGT GGCTTTTCAT CCGAGACCAC 180 

AAGTCCGGGA CCCCGGCCAA CGTGTTCCTG ATGCATCTGG CCGTGGCCGA CTTGTCGTGC 240 

GTGCTGGTCC TGCCCACCCG CCTGGTCTAC CACTTCTCTG GGAACCACTG GCCATTTGGG 3 00 
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GAAATCGCAT GCCGTCTCAC CGGCTTCCTC TTCTACCTCA ACATGTACGC CAGCATCTAC 360 

TTCCTCACCT GCATCAGCGC CGACCGTTTC CTGGCCATTG TGCACCCGGT CAAGTCCCTC 420 

AAGCTCCGCA GGCCCCTCTA CG CACACCTG GCCTGTGCCT TCCTGTGGGT GGTGGTGGCT 480 

GTGGCCATGG CCCCGCTGCT GGTGAGCCCA CAGACCGTGC AGACCAACCA CACGGTGGTC 540 

TGCCTGCAGC TGTACCGGGA GAAGGCCTCC CACCATGCCC TGGTGTCCCT GGCAGTGGCC 600 

TTCACCTTCC CGTTCATCAC CACGGTCACC TGCTACCTGC TGATCATCCG CAGCCTGCGG 660 

CAGGGCCTGC GTGTGGAGAA GCGCCTCAAG ACCAAGGCAG TGCGCATGAT CGCCATAGTG 720 

CTGGCCATCT TCCTGGTCTG CTTCGTGCCC TACCACGTCA ACCGCTCCGT CTACGTGCTG 780 

CACTACCGCA GCCATGGGGC CTCCTGCGCC ACCCAGCGCA TCCTGGCCCT GGCAAACCGC 840 

ATCACCTCCT GCCTCACCAG CCTCAACGGG GCACTCGACC CCATCATGTA TTTCTTCGTG .900 

GCTGAGAAGT TCCGCCACGC CCTGTGCAAC TTGCTCTGTG GCAAAAGGCT CAAGGGCCCG 960 

CCCCCCAGCT TCGAAGGGAA AACCAACGAG AGCTCGCTGA GTGCCAAGTC AGAGCTGTGA 1020 
(33) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 339 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Asn Gly Leu Glu Val Ala Pro Pro Gly Leu lie Thr Asn Phe Ser 
15 10 15 

Leu Ala Thr Ala Glu Gin Cys Gly Gin Glu Thr Pro Leu Glu Asn Met 
20 25 30 

Leu Phe Ala Ser Phe Tyr Leu Leu Asp Phe He Leu Ala Leu Val Gly 
35 40 45 

Asn Thr Leu Ala Leu Trp Leu Phe He Arg Asp His Lys Ser Gly Thr 
50 55 60 

Pro Ala Asn Val Phe Leu Met His Leu Ala Val Ala Asp Leu Ser Cys 
65 70 75 80 

Val Leu Val Leu Pro Thr Arg Leu Val Tyr His Phe Ser Gly Asn His 
85 90 95 

Trp Pro Phe Gly Glu He Ala Cys Arg Leu Thr Gly Phe Leu Phe Tyr 
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100 105 110 

Leu Asn Met Tyr Ala Ser lie Tyr Phe Leu Thr Cys lie Ser Ala Asp 
115 120 125 

Arg Phe Leu Ala lie Val His Pro Val Lys Ser Leu Lys Leu Arg Arg 
130 135 140 

Pro Leu Tyr Ala His Leu Ala Cys Ala Phe Leu Trp Val Val Val Ala 
145 150 155 160 

Val Ala Met Ala Pro Leu Leu Val Ser Pro Gin Thr Val Gin Thr Asn 
165 170 175 

His Thr Val Val Cys Leu Gin Leu Tyr Arg Glu Lys Ala Ser His His 
180 185 190 

Ala Leu Val Ser Leu Ala Val Ala Phe Thr Phe Pro Phe lie Thr Thr 
195 200 205 

Val Thr Cys Tyr Leu Leu lie lie Arg Ser Leu Arg Gin Gly Leu Arg 
210 215 220 

Val Glu Lys Arg Leu Lys Thr Lys Ala Val Arg Met lie Ala lie Val 
225 230 235 240 

Leu Ala lie Phe Leu Val Cys Phe Val Pro Tyr His Val Asn Arg Ser 
245 250 255 

Val Tyr Val Leu His Tyr Arg Ser His Gly Ala Ser Cys Ala Thr Gin 
260 265 , 270 

Arg He Leu Ala Leu Ala Asn Arg He Thr Ser Cys Leu Thr Ser Leu 
275 280 285 

Asn Gly Ala Leu Asp Pro He Met Tyr Phe Phe Val Ala Glu Lys Phe 
290 295 300 

Arg His Ala Leu Cys Asn Leu Leu Cys Gly Lys Arg Leu Lys Gly Pro 
305 310 315 320 

Pro Pro Ser Phe Glu Gly Lys Thr Asn Glu Ser Ser Leu Ser Ala Lys 
325 330 335 

Ser Glu Leu 
(34) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
ATAAGATGAT CACCCTGAAC AATCAAGAT 29 

(35) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TCCGAATTCA TAACATTTCA CTGTTTATAT TGC 33 

(36) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 996 base pairs 

(B) TYPE: nucleic acid ' 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

ATGATCACCC TGAACAATCA AGATCAACCT GTCACTTTTA ACAGCTCACA TCCAGATGAA 60 

TACAAAATTG CAGCCCTTGT CTTCTATAGC TGTATCTTCA TAATTGGATT ATTTGTTAAC 120 

ATCACTGCAT TATGGGTTTT CAGTTGTACC ACCAAGAAGA GAACCACGGT AACCATCTAT 180 

ATGATGAATG TGGCATTAGT GGACTTGATA TTTATAATGA CTTTACCCTT TCGAATGTTT 240 

TATTATGCAA AAGATGCATG GCCATTTGGA GAGTACTTCT GCCAGATTAT TGGAGCTCTC 300 

ACAGTGTTTT ACCCAAGCAT TGCTTTATGG CTTCTTGCCT TTATTAGTGC TGACAGATAC 360 

ATGGCCATTG TACAGCCGAA GTACGCCAAA GAACTTAAAA ACACGTGCAA AGCCGTGCTG 420 

GCGTGTGTGG GAGTCTGGAT AATGACCCTG ACCACGACCA CCCCTCTGCT ACTGCTCTAT 480 

AAAGACCCAG ATAAAGACTC CACTCCCGCC ACCTGCCTCA AGATTTCTGA CATCATCTAT 540 

CTAAAAGCTG TGAACGTGCT GAACCTCACT CGACTGACAT TTTTTTTCTT GATTCCTTTG 600 

TTCATCATGA TTGGGTGCTA CTTGGTCATT ATTCATAATC TCCTTCACGG CAGGACGTCT 660 

AAGCTGAAAC CCAAAGTCAA GGAGAAGTCC ATAAGGATCA TCATCACGCT GCTGGTGCAG 720 
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GTGCTCGTCT GCTTTATGCC CTTCCACATC TGTTTCGCTT TCCTGATGCT GGGAACGGGG 780 

GAGAACAGTT ACAATCCCTG GGGAGCCTTT ACCACCTTCC TCATGAACCT CAGCACGTGT 840 

CTGGATGTGA TTCTCTACTA CATCGTTTCA AAACAATTTC AGGCTCGAGT CATTAGTGTC 900 

ATGCTATACC GTAATTACCT TCGAAGCCTG CGCAGAAAAA GTTTCCGATC TGGTAGTCTA 960 



(37) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Met lie Thr Leu Asn Asn Gin Asp Gin Pro Val Thr Phe Asn Ser Ser 
15 10 15 

His Pro Asp Glu Tyr Lys lie Ala Ala Leu Val Phe Tyr Ser Cys lie 
20 25 30 

Phe lie lie Gly Leu Phe Val Asn lie Thr Ala Leu Trp Val Phe Ser 
35 40 45 

Cys Thr Thr Lys Lys Arg Thr Thr Val Thr lie Tyr Met Met Asn Val 
50 55 60 

Ala Leu Val Asp Leu He Phe He Met Thr Leu Pro Phe Arg Met Phe 
65 70 * IS 80 

Tyr Tyr Ala Lys Asp Ala Trp Pro Phe Gly Glu Tyr Phe Cys Gin He 
85 90 95 

He Gly Ala Leu Thr Val Phe Tyr Pro Ser He Ala Leu Trp Leu Leu 
100 105 110 

Ala Phe He Ser Ala Asp Arg Tyr Met Ala He Val Gin Pro Lys Tyr 
115 120 125 

Ala Lys Glu Leu Lys Asn Thr Cys Lys Ala Val Leu Ala Cys Val Gly 
130 135 140 

Val Trp He Met Thr Leu Thr Thr Thr Thr Pro Leu Leu Leu Leu Tyr 
145 150 155 160 

Lys Asp Pro Asp Lys Asp Ser Thr Pro Ala Thr Cys Leu Lys He Ser 



AGGTCACTAA GCAATATAAA CAGTGAAATG TTATGA 



996 



165 



170 



175 
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Asp lie lie Tyr Leu Lys Ala Val Asn Val Leu Asn Leu Thr Arg Leu 
180 185 190 

Thr Phe Phe Phe Leu He Pro Leu Phe He Met He Gly Cys Tyr Leu 
195 200 205 

5 Val He He His Asn Leu Leu His Gly Arg Thr Ser Lys Leu Lys Pro 

210 215 220 

Lys Val Lys Glu Lys Ser He Arg He He He Thr Leu Leu Val Gin 
225 230 235 240 

Val Leu Val Cys Phe Met Pro Phe His He Cys Phe Ala Phe Leu Met 
10 245 250 255 

Leu Gly Thr Gly Glu Asn Ser Tyr Asn Pro Trp Gly Ala Phe Thr Thr 
260 265 270 

Phe Leu Met Asn Leu Ser Thr Cys Leu Asp Val He Leu Tyr Tyr He 
275 280 285 

15 Val Ser Lys Gin Phe Gin Ala Arg Val He Ser Val Met Leu Tyr Arg 

290 295 300 

Asn Tyr Leu Arg Ser Leu Arg Arg Lys Ser Phe Arg Ser Gly Ser Leu 
305 310 315 * 320 

Arg Ser Leu Ser Asn He Asn Ser Glu Met Leu 
20 325 330 

(38) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CCAAGCTTCC AGGCCTGGGG TGTGCTGG 28 
30 (39) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
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ATGGATCCTG ACCTTCGGCC CCTGGCAGA 29 

(40) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1077 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

ATGCCCTCTG TGTCTCCAGC GGGGCCCTCG GCCGGGGCAG TCCCCAATGC CACCGCAGTG 60 

ACAACAGTGC GGACCAATGC CAGCGGGCTG GAGGTGCCCC TGTTCCACCT GTTTGCCCGG 120 

CTGGACGAGG AGCTGCATGG CACCTTCCCA GGCCTGTGCG TGGCGCTGAT GGCGGTGCAC 180 

GGAGCCATCT TCCTGGCAGG GCTGGTGCTC AACGGGCTGG CGCTGTACGT CTTCTGCTGC 240 

CGCACCCGGG CCAAGACACC CTCAGTCATC TACACCATCA ACCTGGTGGT GACCGATCTA 300 

CTGGTAGGGC TGTCCCTGCC CACGCGCTTC GCTGTGTACT ACGGCGCCAG GGGCTGCCTG 360 

CGCTGTGCCT TCCCGCACGT CCTCGGTTAC TTCCTCAACA TGCACTGCTC CATCCTCTTC 420 

CTCACCTGCA TCTGCGTGGA CCGCTACCTG GCCATCGTGC GGCCCGAAGG CTCCCGCCGC 480 

TGCCGCCAGC CTGCCTGTGC CAGGGCCGTG TGCGCCTTCG TGTGGCTGGC CGCCGGTGCC 540 

GTCACCCTGT CGGTGCTGGG CGTGACAGGC AGCCGGCCCT GCTGCCGTGT CTTTGCGCTG 600 

ACTGTCCTGG AGTTCCTGCT GCCCCTGCTG GTCATCAGCG TGTTTACCGG CCGCATCATG 660 

TGTGCACTGT CGCGGCCGGG TCTGCTCCAC CAGGGTCGCC AGCGCCGCGT GCGGGCCATG 720 

CAGCTCCTGC TCACGGTGCT CATCATCTTT CTCGTCTGCT TCACGCCCTT CCACGCCCGC 780 

CAAGTGGCCG TGGCGCTGTG GCCCGACATG CCACACCACA CGAGCCTCGT GGTCTACCAC 84 0 

GTGGCCGTGA CCCTCAGCAG CCTCAACAGC TGCATGGACC CCATCGTCTA CTGCTTCGTC 900 

ACCAGTGGCT TCCAGGCCAC CGTCCGAGGC CTCTTCGGCC AGCACGGAGA GCGTGAGCCC 960 

AGCAGCGGTG ACGTGGTCAG CATGCACAGG AGCTCCAAGG GCTCAGGCCG TCATCACATC 1020 

CTCAGTGCCG GCCCTCACGC CCTCACCCAG GCCCTGGCTA ATGGGCCCGA GGCTTAG 1077 

(41) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 358 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
Met Pro Ser Val Ser Pro Ala Gly Pro Ser Ala Gly Ala Val Pro Asn 

Ala Thr Ala Val Thr Thr Val Arg Thr Asn Ala Ser Gly Leu Glu Val 
20 25 30 

Pro Leu Phe His Leu Phe Ala Arg Leu Asp Glu Glu Leu His Gly Thr 
35 40 45 

Phe Pro Gly Leu Cys Val Ala Leu Met Ala Val His Gly Ala He Phe 
50 55 60 

Leu Ala Gly Leu Val Leu Asn Gly Leu Ala Leu Tyr Val Phe Cys Cys 
15 65 70 75 80 

Arg Thr Arg Ala Lys Thr Pro Ser Val He Tyr Thr He Asn Leu Val 
85 so 95 

Val Thr Asp Leu Leu Val Gly Leu Ser Leu Pro Thr Arg Phe Ala Val 
100 105 no 

20 Tyr Tyr Gly Ala Arg Gly Cys Leu Arg Cys Ala Phe Pro His Val Leu 

115 120 125 

Gly Tyr Phe Leu Asn Met His Cys Ser He Leu Phe Leu Thr Cys He 
130 135 140 

Cys Val Asp Arg Tyr Leu Ala lie Val Arg Pro Glu Ala Pro Ala Ala 
25 "5 150 155 160 

Cys Arg Gin Pro Ala Cys Ala Arg Ala Val Cys Ala Phe Val Trp Leu 
165 170 175 

Ala Ala Gly Ala Val Thr Leu Ser Val Leu Gly Val Thr Gly Ser Arg 
180 185 190 

Pro Cys Cys Arg Val Phe Ala Leu Thr Val Leu Glu Phe Leu Leu Pro 
195 200 205 

Leu Leu Val He Ser Val Phe Thr Gly Arg lie Met Cys Ala Leu Ser 
210 215 220 

Arg Pro Gly Leu Leu His Gin Gly Arg Gin Arg Arg Val Arg Ala Met 
35 225 230 235 ~ 2 40 

Gin Leu Leu Leu Thr Val Leu He He Phe Leu Val Cys Phe Thr Pro 
245 250 255 



30 
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Phe His Ala Arg Gin Val Ala Val Ala Leu Trp Pro Asp Met Pro His 
260 265 270 

His Thr Ser Leu Val Val Tyr His Val Ala Val Thr Leu Ser Ser Leu 
275 280 285 

Asn Ser Cys Met Asp Pro He Val Tyr Cys Phe Val Thr Ser Gly Phe 
290 295 300 

Gin Ala Thr Val Arg Gly Leu Phe Gly Gin His Gly Glu Arg Glu Pro 
305 310 315 320 

Ser Ser Gly Asp Val Val Ser Met His Arg Ser Ser Lys Gly Ser Gly 
325 330 335 

Arg His His He Leu Ser Ala Gly Pro His Ala Leu Thr Gin Ala Leu 
340 345 350 

Ala Asn Gly Pro Glu Ala 
355 

(42) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GAGAATTCAC TCCTGAGCTC AAGATGAACT 30 

(43) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
CGGGATCCCC GTAACTGAGC CACTTCAGAT 3 0 

(44) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1050 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

ATGAACTCCA CCTTGGATGG TAATCAGAGC AGCCACCCTT TTTGCCTCTT GGCATTTGGC 60 

TATTTGGAAA CTGTCAATTT TTGCCTTTTG GAAGTATTGA TTATTGTCTT TCTAACTGTA 120 

TTGATTATTT CTGGCAACAT CATTGTGATT TTTGTATTTC ACTGTGCACC TTTGTTGAAC 180 

CATCACACTA CAAGTTATTT TATCCAGACT ATGGCATATG CTGACCTTTT TGTTGGGGTG 240 

AGCTGCGTGG TCCCTTCTTT ATCACTCCTC CATCACCCCC TTCCAGTAGA GGAGTCCTTG 300 

ACTTGCCAGA TATTTGGTTT TGTAGTATCA GTTCTGAAGA GCGTCTCCAT GGCTTCTCTG 360 

GCCTGTATCA GCATTGATAG ATACATTGCC ATTACTAAAC CTTTAACCTA TAATACTCTG 420 

GTTACACCCT GGAGACTACG CCTGTGTATT TTCCTGATTT GGCTATACTC GACCCTGGTC 480 

TTCCTGCCTT CCTTTTTCCA CTGGGGCAAA CCTGGATATC ATGGAGATGT GTTTCAGTGG 540 

TGTGCGGAGT CCTGGCACAC CGACTCCTAC TTCACCCTGT TCATCGTGAT GATGTTATAT 600 

GCCCCAGCAG CCCTTATTGT CTGCTTCACC TATTTCAACA TCTTCCGCAT CTGCCAACAG 660 

CACACAAAGG ATATCAGCGA AAGGCAAGCC CGCTTCAGCA GCCAGAGTGG GGAGACTGGG 720 

GAAGTGCAGG CCTGTCCTGA TAAGCGCTAT GCCATGGTCC TGTTTCGAAT CACTAGTGTA 780 

TTTTACATCC TCTGGTTGCC ATATATCATC TACTTCTTGT TGGAAAGCTC CACTGGCCAC 840 

AGCAACCGCT TCGCATCCTT CTTGACCACC TGGCTTGCTA TTAGTAACAG TTTCTGCAAC 900 

TGTGTAATTT ATAGTCTCTC CAACAGTGTA TTCCAAAGAG GACTAAAGCG CCTCTCAGGG 960 

GCTATGTGTA CTTCTTGTGC AAGTCAGACT ACAGCCAACG ACCCTTACAC AGTTAGAAGC 1020 

AAAGGCCCTC TTAATGGATG TCATATCTGA 1050 
(45) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
Met Asn Ser Thr Leu Asp Gly Asn Gin Ser Ser His Pro Phe Cys Leu 
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15 10 15 

Leu Ala Phe Gly Tyr Leu Glu Thr Val Asn Phe Cys Leu Leu Glu Val 
20 25 30 

Leu He He Val Phe Leu Thr Val Leu He He Ser Gly Asn He He 
35 40 45 

Val He Phe Val Phe His Cys Ala Pro Leu Leu Asn His His Thr Thr 
50 55 60 

Ser Tyr Phe He Gin Thr Met Ala Tyr Ala Asp Leu Phe Val Gly Val 
65 70 75 80 

Ser Cys Val Val Pro Ser Leu Ser Leu Leu His His Pro Leu Pro Val 
85 90 95 

Glu Glu Ser Leu Thr Cys Gin lie Phe Gly Phe Val Val Ser Val Leu 
100 105 no 

Lys Ser Val Ser Met Ala Ser Leu Ala Cys He Ser He Asp Arg Tyr 
115 120 125 

He Ala He Thr Lys Pro Leu Thr Tyr Asn Thr Leu Val Thr Pro Trp 
130 135 140 

Arg Leu Arg Leu Cys He Phe Leu He Trp Leu Tyr Ser Thr Leu Val 
145 150 155 160 

Phe Leu Pro Ser Phe Phe His Trp Gly Lys Pro Gly Tyr His Gly Asp 
165 170 175 

Val Phe Gin Trp Cys Ala Glu Ser Trp His Thr Asp Ser Tyr Phe Thr 
180 185 190 

Leu Phe He Val Met Met Leu Tyr Ala Pro Ala Ala Leu He Val Cys 
195 200 205 

Phe Thr Tyr Phe Asn He Phe Arg He Cys Gin Gin His Thr Lys Asp 
210 215 220 

He Ser Glu Arg Gin Ala Arg Phe Ser Ser Gin Ser Gly Glu Thr Gly 
225 230 235 240 

Glu Val Gin Ala Cys Pro Asp Lys Arg Tyr Ala Met Val Leu Phe Arg 
245 250 255 

He Thr Ser Val Phe Tyr He Leu Trp Leu Pro Tyr He He Tyr Phe 
260 265 270 

Leu Leu Glu Ser Ser Thr Gly His Ser Asn Arg Phe Ala Ser Phe Leu 
275 280 285 



Thr Thr Trp Leu Ala He Ser Asn Ser Phe Cys Asn Cys Val He Tyr 
290 295 300 
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Ser Leu Ser Asn Ser 
305 



Val Phe Gin 
310 



Arg Gly Leu Lys Arg Leu Ser Gly 
315 ~ 320 



Ala Met Cys Thr Ser 
325 



Cys Ala Ser 



Gin Thr Thr Ala Asn Asp Pro Tyr 
330 335 



Thr Val Arg Ser Lys 
340 



Gly Pro Leu 



Asn Gly Cys His He 
345 



(46) INFORMATION FOR SEQ ID NO:45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
TCCCCCGGGA AAAAAACCAA CTGCTCCAAA 30 

(47) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
TAGGATCCAT TTGAATGTGG ATTTGGTGAA A 3l 

(48) INFORMATION FOR SEQ ID NO:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1302 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

ATGTGTTTTT CTCCCATTCT GGAAATCAAC ATGCAGTCTG AATCTAACAT TACAGTGCGA 60 

GATGACATTG ATGACATCAA CACCAATATG TACCAACCAC TATCATATCC GTTAAGCTTT 120 

CAAGTGTCTC TCACCGGATT TCTTATGTTA GAAATTGTGT TGGGACTTGG CAGCAACCTC 180 
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ACTGTATTGG TACTTTACTG CATGAAATCC AACTTAATCA ACTCTGTCAG TAACATTATT 240 

ACAATGAATC TTCATGTACT TGATGTAATA ATTTGTGTGG GATGTATTCC TCTAACTATA 300 

GTTATCCTTC TGCTTTCACT GGAGAGTAAC ACTGCTCTCA TTTGCTGTTT CCATGAGGCT 360 

TGTGTATCTT TTGCAAGTGT CTCAACAGCA ATCAACGTTT TTGCTATCAC TTTGGACAGA 420 

5 TATGACATCT CTGTAAAACC TGCAAACCGA ATTCTGACAA TGGGCAGAGC TGTAATGTTA 480 

ATGATATCCA TTTGGATTTT TTCTTTTTTC TCTTTCCTGA TTCCTTTTAT TGAGGTAAAT 540 

TTTTTCAGTC TTCAAAGTGG AAATACCTGG GAAAACAAGA CACTTTTATG TGTCAGTACA 600 

AATGAATACT ACACTGAACT GGGAATGTAT TATCACCTGT TAGTACAGAT CCCAATATTC 660 

TTTTTCACTG TTGTAGTAAT GTTAATCACA TACACCAAAA TACTTCAGGC TCTTAATATT 720 

10 CGAATAGGCA CAAGATTTTC AACAGGGCAG AAGAAGAAAG CAAGAAAGAA AAAGACAATT 780 

TCTCTAACCA CACAACATGA GGCTACAGAC ATGTCACAAA GCAGTGGTGG GAGAAATGTA 840 

GTCTTTGGTG TAAGAACTTC AGTTTCTGTA ATAATTGCCC TCCGGCGAGC TGTGAAACGA 900 

CACCGTGAAC GACGAGAAAG ACAAAAGAGA GTCTTCAGGA TGTCTTTATT GATTATTTCT 960 

ACATTTCTTC TCTGCTGGAC ACCAATTTCT GTTTTAAATA CCACCATTTT ATGTTTAGGC 1020 

15 CCAAGTGACC TTTTAGTAAA ATTAAGATTG TGTTTTTTAG TCATGGCTTA TGGAACAACT 1080 

ATATTTCACC CTCTATTATA TGCATTCACT AGACAAAAAT TTCAAAAGGT CTTGAAAAGT 1140 

AAAATGAAAA AGCGAGTTGT TTCTATAGTA GAAGCTGATC CCCTGCCTAA TAATGCTGTA 1200 

ATACACAACT CTTGGATAGA TCCCAAAAGA AACAAAAAAA TTACCTTTGA AGATAGTGAA 1260 

ATAAGAGAAA AACGTTTAGT GCCTCAGGTT GTCACAGACT AG 1302 
20 (49) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

25 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Met Cys Phe Ser Pro He Leu Glu He Asn Met Gin Ser Glu Ser Asn 
1 5 10 .15 

30 He Thr Val Arg Asp Asp He Asp Asp He Asn Thr Asn Met Tyr Gin 

20 25 30 
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Pro Leu Ser Tyr Pro Leu Ser Phe Gin Val Ser Leu Thr Gly Phe Leu 
35 40 45 

Met Leu Glu lie Val Leu Gly Leu Gly Ser Asn Leu Thr Val Leu Val 
50 55 60 

5 Leu Tyr Cys Met Lys Ser Asn Leu lie Asn Ser Val Ser Asn lie He 

65 70 75 80 

Thr Met Asn Leu His Val Leu Asp Val He He Cys Val Gly Cys He 
85 90 95 

Pro Leu Thr He Val He Leu Leu Leu Ser Leu Glu Ser Asn Thr Ala 
10 100 105 110 

Leu He Cys Cys Phe His Glu Ala Cys Val Ser Phe Ala Ser Val Ser 
115 120 125 

Thr Ala He Asn Val Phe Ala He Thr Leu Asp Arg Tyr Asp He Ser 
130 135 140 

15 Val Lys Pro Ala Asn Arg He Leu Thr Met Gly Arg Ala Val Met Leu 

145 150 155 160 

Met He Ser He Trp He Phe Ser Phe Phe Ser Phe Leu He Pro Phe 
165 170 175 

He Glu Val Asn Phe Phe Ser Leu Gin Ser Gly Asn Thr Trp Glu Asn 
20 180 185 190 

Lys Thr Leu Leu Cys Val Ser Thr Asn Glu Tyr Tyr Thr Glu Leu Gly 
195 200 205 

Met Tyr Tyr His Leu Leu Val Gin He Pro He Phe Phe Phe Thr Val 
210 215 220 

25 Val Val Met Leu He Thr Tyr Thr Lys He Leu Gin Ala Leu Asn He 

225 230 235 240 

Arg He Gly Thr Arg Phe Ser Thr Gly Gin Lys Lys Lys Ala Arg Lys 
245 250 255 

Lys Lys Thr He Ser Leu Thr Thr Gin His Glu Ala Thr Asp Met Ser 
30 260 265 270 

Gin Ser Ser Gly Gly Arg Asn Val Val Phe Gly Val Arg Thr Ser Val 
275 280 285 

Ser Val He He Ala Leu Arg Arg Ala Val Lys Arg His Arg Glu Arg 
290 295 300 

35 Arg Glu Arg Gin Lys Arg Val Phe Arg Met Ser Leu Leu He He Ser 

305 310 315 320 



Thr Phe Leu Leu Cys Trp Thr Pro He Ser Val Leu Asn Thr Thr He 
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325 330 335 

Leu Cys Leu Gly Pro Ser Asp Leu Leu Val Lys Leu Arg Leu Cys Phe 
340 345 350 

Leu Val Met Ala Tyr Gly Thr Thr lie Phe His Pro Leu Leu Tyr Ala 
355 360 365 

Phe Thr Arg Gin Lys Phe Gin Lys Val Leu Lys Ser Lys Met Lys Lys 
370 375 380 

Arg Val Val Ser lie Val Glu Ala Asp Pro Leu Pro Asn Asn Ala Val 
385 390 395 400 

lie His Asn Ser Trp lie Asp Pro Lys Arg Asn Lys Lys lie Thr Phe 
405 410 415 

Glu Asp Ser Glu lie Arg Glu Lys Arg Leu Val Pro Gin Val Val Thr 
420 425 430 

Asp 



(50) INFORMATION FOR SEQ ID NO: 49: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic)' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
GTGAAGCTTG CCTCTGGTGC CTGCAGGAGG 30 

(51) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GCAGAATTCC CGGTGGCGTG TTGTGGTGCC C 31 

(52) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1209 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDKDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

ATGTTGTGTC CTTCCAAGAC AGATGGCTCA GGGCACTCTG GTAGGATTCA CCAGGAAACT 60 

CATGGAGAAG GGAAAAGGGA CAAGATTAGC AACAGTGAAG GGAGGGAGAA TGGTGGGAGA 120 

GGATTCCAGA TGAACGGTGG GTCGCTGGAG GCTGAGCATG CCAGCAGGAT GTCAGTTCTC 180 

AGAGCAAAGC CCATGTCAAA CAGCCAACGC TTGCTCCTTC TGTCCCCAGG ATCACCTCCT 240 

10 CGCACGGGGA GCATCTCCTA CATCAACATC ATCATGCCTT CGGTGTTCGG CACCATCTGC 300 

CTCCTGGGCA TCATCGGGAA CTCCACGGTC ATCTTCGCGG TCGTGAAGAA GTCCAAGCTG 360 

CACTGGTGCA ACAACGTCCC CGACATCTTC ATCATCAACC TCTCGGTAGT AGATCTCCTC 420 

TTTCTCCTGG GCATGCCCTT CATGATCCAC CAGCTCATGG GCAATGGGGT GTGGCACTTT 480 

GGGGAGACCA TGTGCACCCT CATCACGGCC ATGGATGCCA ATAGTCAGTT CACCAGCACC 540 

15 TACATC.CTGA CCGCCATGGC CATTGACCGC TACCTGGCCA CTGTCCACCC CATCTCTTCC 600 

ACGAAGTTCC GGAAGCCCTC TGTGGCCACC CTGGTGATCT GCCTCCTGTG GGCCCTCTCC 660 

TTCATCAGCA TCACCCCTGT GTGGCTGTAT GCCAGACTCA TCCCCTTCCC AGGAGGTGCA 720 

GTGGGCTGCG GCATACGCCT GCCCAACCCA GACACTGACC TCTACTGGTT CACCCTGTAC 780 

CAGTTTTTCC TGGCCTTTGC CCTGCCTTTT GTGGTCATCA CAGCCGCATA CGTGAGGATC 840 

20 CTGCAGCGCA TGACGTCCTC AGTGGCCCCC GCCTCCCAGC GCAGCATCCG GCTGCGGACA 900 

AAGAGGGTGA CCCGCACAGC CATCGCCATC TGTCTGGTCT TCTTTGTGTG CTGGGCACCC 960 

TACTATGTGC TACAGCTGAC CCAGTTGTCC ATCAGCCGCC CGACCCTCAC CTTTGTCTAC 1020 

TTATACAATG CGGCCATCAG CTTGGGCTAT GCCAACAGCT GCCTCAACCC CTTTGTGTAC 1080 

ATCGTGCTCT GTGAGACGTT CCGCAAACGC TTGGTCCTGT CGGTGAAGCC TGCAGCCCAG 1140 

25 GGGCAGCTTC GCGCTGTCAG CAACGCTCAG ACGGCTGACG AGGAGAGGAC AGAAAGCAAA 1200 

GGCACCTGA 1209 

(53) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 amino acids 
30 (B) TYPE: amino acid 
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(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

5 Met Leu Cys Pro Ser Lys Thr Asp Gly Ser Gly His Ser Gly Arg lie 

1 5 10 15 

His Gin Glu Thr His Gly Glu Gly Lys Arg Asp Lys He Ser Asn Ser 
20 25 30 

Glu Gly Arg Glu Asn Gly Gly Arg Gly Phe Gin Met Asn Gly Gly Ser 
10 35 40 45 

Leu Glu Ala Glu His Ala Ser Arg Met Ser Val Leu Arg Ala Lys Pro 
50 55 60 

Met Ser Asn Ser Gin Arg Leu Leu Leu Leu Ser Pro Gly Ser Pro Pro 
65 70 75 80 

15 Arg Thr Gly Ser He Ser Tyr He Asn He He Met Pro Ser Val Phe 

85 90 95 

Gly Thr He Cys Leu Leu Gly He He Gly Asn Ser Thr Val He Phe 
100 105 110 

Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp 
20 115 120 125 

He Phe He He Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly 
130 135 140 

Met Pro Phe Met He His Gin Leu Met Gly Asn Gly Val Trp His Phe 
145 150 155 160 

25 Gly Glu Thr Met Cys Thr Leu He Thr Ala Met Asp Ala Asn Ser Gin 

165 170 175 

Phe Thr Ser Thr Tyr He Leu Thr Ala Met Ala He Asp Arg Tyr Leu 
180 185 190 

Ala Thr Val His Pro He Ser Ser Thr Lys Phe Arg Lys Pro Ser Val 
30 195 200 205 

Ala Thr Leu Val He Cys Leu Leu Trp Ala Leu Ser Phe He Ser He 
210 215 220 

Thr Pro Val Trp Leu Tyr Ala Arg Leu He Pro Phe Pro Gly Gly Ala 
225 230 235 240 

35 Val Gly Cys Gly He Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp 

245 250 255 
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Phe Thr Leu Tyr Gin Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val 
260 265 270 

He Thr Ala Ala Tyr Val Arg He Leu Gin Arg Met Thr Ser Ser Val 
275 280 235 

5 Ala Pro Ala Ser Gin Arg Ser He Arg Leu Arg Thr Lys Arg Val Thr 

290 295 300 

Arg Thr Ala He Ala He Cys Leu Val Phe Phe Val Cys Tip Ala Pro 
305 310 315 320 

Tyr Tyr Val Leu Gin Leu Thr Gin Leu Ser He Ser Arg Pro Thr Leu 
10 325 330 335 

Thr Phe Val Tyr Leu Tyr Asn Ala Ala He Ser Leu Gly Tyr Ala Asn 
340 . 345 350 

Ser Cys Leu Asn Pro Phe Val Tyr He Val Leu Cys Glu Thr Phe Arg 
355 360 365 

15 Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gin Gly Gin Leu Arg 

370 375 380 

Ala Val Ser Asn Ala Gin Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys 
385 390 395 400 



20 



Gly Thr 

(54) INFORMATION FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGCGGATCCA TGGATGTGAC TTCCCAA 27 
30 (55) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
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GGCGGATCCC TACACGGCAC TGCTGAA 27 
(56) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1128 base pairs 

5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

10 ATGGATGTGA CTTCCCAAGC CCGGGGCGTG GGCCTGGAGA TGTACCCAGG CACCGCGCAC 60 

GCTGCGGCCC CCAACACCAC CTCCCCCGAG CTCAACCTGT CCCACCCGCT CCTGGGCACC 120 

GCCCTGGCCA ATGGGACAGG TGAGCTCTCG GAGCACCAGC AGTACGTGAT CGGCCTGTTC 180 

CTCTCGTGCC TCTACACCAT CTTCCTCTTC CCCATCGGCT TTGTGGGCAA CATCCTGATC 24 0 

CTGGTGGTGA ACATCAGCTT CCGCGAGAAG ATGACCATCC CCGACCTGTA CTTCATCAAC 300 

15 CTGGCGGTGG CGGACCTCAT CCTGGTGGCC GACTCCCTCA TTGAGGTGTT CAACCTGCAC 360 

GAGCGGTACT ACGACATCGC CGTCCTGTGC ACCTTCATGT CGCTCTTCCT GCAGGTCAAC 420 

ATGTACAGCA GCGTCTTCTT CCTCACCTGG ATGAGCTTCG ACCGCTACAT CGCCCTGGCC 480 

AGGGCCATGC GCTGCAGCCT GTTCCGCACC AAGCACCACG CCCGGCTGAG CTGTGGCCTC 540 

ATCTGGATGG CATCCGTGTC AGCCACGCTG GTGCCCTTCA CCGCCGTGCA CCTGCAGCAC 600 

20 ACCGACGAGG CCTGCTTCTG TTTCGCGGAT GTCCGGGAGG TGCAGTGGCT CGAGGTCACG 660 

CTGGGCTTCA TCGTGCCCTT CGCCATCATC GGCCTGTGCT ACTCCCTCAT TGTCCGGGTG 720 

CTGGTCAGGG CGCACCGGCA CCGTGGGCTG CGGCCCCGGC GGCAGAAGGC GCTCCGCATG 780 

ATCCTCGCGG TGGTGCTGGT CTTCTTCGTC TGCTGGCTGC CGGAGAACGT CTTCATCAGC 840 

GTGCACCTCC TGCAGCGGAC GCAGCCTGGG GCCGCTCCCT GCAAGCAGTC TTTCCGCCAT 900 

25 GCCCACCCCC TCACGGGCCA CATTGTCAAC CTCGCCGCCT TCTCCAACAG CTGCCTAAAC 960 

CCCCTCATCT ACAGCTTTCT CGGGGAGACC TTCAGGGACA AGCTGAGGCT GTACATTGAG 1020 

CAGAAAACAA ATTTGCCGGC CCTGAACCGC TTCTGTCACG CTGCCCTGAA GGCCGTCATT 1080 
» 

CCAGACAGCA CCGAGCAGTC GGATGTGAGG TTCAGCAGTG CCGTGTGA 1128 
(57) INFORMATION FOR SEQ ID NO: 56: 
30 (i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Met Asp Val Thr Ser Gin Ala Arg Gly Val Gly Leu Glu Met Tyr Pro 
15 10 15 

Gly Thr Ala His Ala Ala Ala Pro Asn Thr Thr Ser Pro Glu Leu Asn 
20 25 30 

Leu Ser His Pro Leu Leu Gly Thr Ala Leu Ala Asn Gly Thr Gly Glu 
35 40 45 

Leu Ser Glu His Gin Gin Tyr Val lie Gly Leu Phe Leu Ser Cys Leu 
50 55 60 

Tyr Thr lie Phe Leu Phe Pro lie Gly Phe Val Gly Asn lie Leu lie 
65 70 75 80 

Leu Val Val Asn lie Ser Phe Arg Glu Lys Met Thr He Pro Asp Leu 
85 90 95 

Tyr Phe He Asn Leu Ala Val Ala Asp Leu He Leu Val Ala Asp Ser 
100 105 no 

Leu He Glu Val Phe Asn Leu His Glu Arg Tyr Tyr Asp He Ala Val 
115 120 125 

Leu Cys Thr Phe Met Ser Leu Phe Leu Gin Val Asn Met Tyr Ser Ser 
130 135 140 

Val Phe Phe Leu Thr Trp Met Ser Phe Asp Arg Tyr He Ala Leu Ala 
145 150 155 160 

Arg Ala Met Arg Cys Ser Leu Phe Arg Thr Lys His His Ala Arg Leu 
165 170 175 

Ser Cys Gly Leu He Trp Met Ala Ser Val Ser Ala Thr Leu Val Pro 
180 185 190 

Phe Thr Ala Val His Leu Gin His Thr Asp Glu Ala Cys Phe Cys Phe 
195 200 205 

Ala Asp Val Arg Glu Val Gin Trp Leu Glu Val Thr Leu Gly Phe He 
210 215 220 

Val Pro Phe Ala He He Gly Leu Cys Tyr Ser Leu He Val Arg Val 
225 230 235 240 

Leu Val Arg Ala His Arg His Arg Gly Leu Arg Pro Arg Arg Gin Lys 
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245 250 255 

Ala Leu Arg Met He Leu Ala Val Val Leu Val Phe Phe Val Cys Trp 
260 265 270 

Leu Pro Glu Asn Val Phe He Ser Val His Leu Leu Gin Arg Thr Gin 
5 275 260 285 

Pro Gly Ala Ala Pro Cys Lys Gin Ser Phe Arg His Ala His Pro Leu 
290 295 300 

Thr Gly His He Val Asn Leu Ala Ala Phe Ser Asn Ser Cys Leu Asn 
305 310 315 320 

10 Pro Leu He Tyr Ser Phe Leu Gly Glu Thr Phe Arg Asp Lys Leu Arg 

325 330 335 

Leu Tyr He Glu Gin Lys Thr Asn Leu Pro Ala Leu Asn Arg Phe Cys 
340 345 350 

His Ala Ala Leu Lys Ala Val He Pro Asp Ser Thr Glu Gin Ser Asp 
15 355 360 365 

Val Arg Phe Ser Ser Ala Val 
370 375 

(58) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 

25 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

AAGGAATTCA CGGCCGGGTG ATGCCATTCC C 31 

(59) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 

30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
35 GGTGGATCCA TAAACACGGG CGTTGAGGAC 30 

(60) INFORMATION FOR SEQ ID NO: 59: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 960 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

ATGCCATTCC CAAACTGCTC AGCCCCCAGC ACTGTGGTGG 

CTGGGGCTGG AGTGTGGGCT GGGTCTGCTG GGCAACGCGG 
10 TTCCGGGTCA GGGTGTGGAA GCCGTACGCT GTCTACCTGC 

CTGCTGTTGG CTGCGTGCCT GCCTTTCCTG GCCGCCTTCT 

CATCTGGGCC GTGTGGGCTG CTGGGCCCTG CGCTTCCTGC 

GGGATGGCCT TCCTGGCCGC CGTGGCTTTG GACCGGTACC 

CTTAAGGTCA ACCTGCTGTC TCCTCAGGCG GCCCTGGGGG 
15 CTGATGGTCG CCCTCACCTG CCCGGGCTTG CTCATCTCTG 

AGGTGCCACA GTTTCTACTC CAGGGCAGAC GGCTCCTTCA 

CTCTCCTGCC TTCAGTTTGT CCTCCCCTTT GGCCTCATCG 

ATCAGGGCTC TCCAGAAAAG ACTCCGGGAG CCTGAGAAAC 

CAGGCACTGG TCACCTTGGT GGTGGTGCTG TTTGCTCTGT 
20 GCCAGAGTCC TGATGCACAT CTTCCAGAAT CTGGGGAGCT 

GCTCATACCT CGGATGTCAC GGGCAGCCTC ACCTACCTGC 

GTATACTGCT TCTCCAGCCC CACCTTCAGG AGCTCCTATC 

CGAGGCAAAG GGCAGGCAGC AGAGCCCCCA GATTTCAACC 

(61) INFORMATION FOR SEQ ID NO: 60: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 
Met Pro Phe Pro Asn Cys Ser Ala Pro Ser Thr Val Val Ala Thr Ala 



CCACAGCTGT GGGTGTCTTG 60 

TGGCGCTGTG GACCTTCCTG 120 

TCAACCTGGC CCTGGCTGAC 180 

ACCTGAGCCT CCAGGCTTGG 240 

TGGACCTCAG CCGCAGCGTG 300 

TCCGTGTGGT CCACCCTCGG 360 

TCTCGGGCCT CGTCTGGCTC 420 

AGGCCGCCCA GAACTCCACC 480 

GCATCATCTG GCAGGAAGCA 540 

TGTTCTGCAA TGCAGGCATC 600 

AGCCCAAGCT TCAGCGGGCC 660 

GCTTTCTGCC CTGCTTCCTG 720 

GCAGGGCCCT TTGTGCAGTG 780 

ACAGTGTCGT CAACCCCGTG 840 

GGAGGGTCTT CCACACCCTC 900 

CCAGAGACTC CTATTCCTGA 960 
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i 5 10 15 

Val Gly Val Leu Leu Gly Leu Glu Cys Gly Leu Gly Leu Leu Gly Asn 
20 25 30 

Ala Val Ala Leu Trp Thr Phe Leu Phe Arg Val Arg Val Trp Lys Pro 
5 35 40 45 

Tyr Ala Val Tyr Leu Leu Asn Leu Ala Leu Ala Asp Leu Leu Leu Ala 
50 55 60 

Ala Cys Leu Pro Phe Leu Ala Ala Phe Tyr Leu Ser Leu Gin Ala Trp 
65 70 75 80 

10 His Leu Gly Arg Val Gly Cys Trp Ala Leu Arg Phe Leu Leu Asp Leu 

85 90 95 

Ser Arg Ser Val Gly Met Ala Phe Leu Ala Ala Val Ala Leu Asp Arg 
100 105 110 

Tyr Leu Arg Val Val His Pro Arg Leu Lys Val Asn Leu Leu Ser Pro 
15 115 120 125 

Gin Ala Ala Leu Gly Val Ser Gly Leu Val Trp Leu Leu Met Val Ala 
130 135 140 

Leu Thr Cys Pro Gly Leu Leu lie Ser Glu Ala Ala Gin Asn Ser Thr 
145 150 155 160 

20 Arg Cys His Ser Phe Tyr Ser Arg Ala Asp Gly Ser Phe Ser lie lie 

165 170 175 

Trp Gin Glu Ala Leu Ser Cys Leu Gin Phe Val Leu Pro Phe Gly Leu 
180 185 190 

lie Val Phe Cys Asn Ala Gly lie lie Arg Ala Leu Gin Lys Arg Leu 
25 195 200 205 

Arg Glu Pro Glu Lys Gin Pro Lys Leu Gin Arg Ala Gin Ala Leu Val 
210 215 220 

Thr Leu Val Val Val Leu Phe Ala Leu Cys Phe Leu Pro Cys Phe Leu 
225 230 235 240 

30 Ala Arg Val Leu Met His lie Phe Gin Asn Leu Gly Ser Cys Arg Ala 

245 250 255 

Leu Cys Ala Val Ala His Thr Ser Asp Val Thr Gly Ser Leu Thr Tyr 
260 265 270 

Leu His Ser Val Val Asn Pro Val Val Tyr Cys Phe Ser Ser Pro Thr 
35 275 280 285 



Phe Arg Ser Ser Tyr Arg Arg Val Phe His Thr Leu Arg Gly Lys Gly 
290 295 300 
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Gin Ala Ala Glu Pro Pro Asp Phe Asn Pro Arg Asp Ser Tyr Ser 
305 310 315 

(62) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

10 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

ATGGAGGAAG GTGGTGATTT TGACAACTAC TATGGGGCAG ACAACCAGTC TGAGTGTGAG 60 

TACACAGACT GGAAATCCTC GGGGGCCCTC ATCCCTGCCA TCTACATGTT GGTCTTCCTC 120 

CTGGGCACCA CGGGAAACGG TCTGGTGCTC TGGACCGTGT TTCGGAGCAG CCGGGAGAAG 180 

AGGCGCTCAG CTGATATCTT CATTGCTAGC CTGGCGGTGG CTGACCTGAC CTTCGTGGTG 240 

15 ACGCTGCCCC TGTGGGCTAC CTACACGTAC CGGGACTATG ACTGGCCCTT TGGGACCTTC 300 

TTCTGCAAGC TCAGCAGCTA CCTCATCTTC GTCAACATGT ACGCCAGCGT CTTCTGCCTC 360 

ACCGGCCTCA GCTTCGACCG CTACCTGGCC ATCGTGAGGC CAGTGGCCAA TGCTCGGCTG 420 

AGGCTGCGGG TCAGCGGGGC CGTGGCCACG GCAGTTCTTT GGGTGCTGGC CGCCCTCCTG 480 

GCCATGCCTG TCATGGTGTT ACGCACCACC GGGGACTTGG AGAACACCAC TAAGGTGCAG 540 

20 TGCTACATGG ACTACTCCAT GGTGGCCACT GTGAGCTCAG AGTGGGCCTG GGAGGTGGGC 600 

CTTGGGGTCT CGTCCACCAC CGTGGGCTTT GTGGTGCCCT TCACCATCAT GCTGACCTGT 660 

TACTTCTTCA TCGCCCAAAC CATCGCTGGC CACTTCCGCA AGGAACGCAT CGAGGGCCTG 720 

CGGAAGCGGC GCCGGCTGCT CAGCATCATC GTGGTGCTGG TGGTGACCTT TGCCCTGTGC 78 0 

TGGATGCCCT ACCACCTGGT GAAGACGCTG TACATGCTGG GCAGCCTGCT GCACTGGCCC 840 

25 TGTGACTTTG ACCTCTTCCT CATGAACATC TTCCCCTACT GCACCTGCAT CAGCTACGTC 900 

AACAGCTGCC TCAACCCCTT CCTCTATGCC TTTTTCGACC CCCGCTTCCG CCAGGCCTGC 960 

ACCTCCATGC TCTGCTGTGG CCAGAGCAGG TGCGCAGGCA CCTCCCACAG CAGCAGTGGG 1020 

GAGAAGTCAG CCAGCTACTC TTCGGGGCAC AGCCAGGGGC CCGGCCCCAA CATGGGCAAG 1080 

GGTGGAGAAC AGATGCACGA GAAATCCATC CCCTACAGCC AGGAGACCCT TGTGGTTGAC 1140 

30 TAG 1143 
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(63) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Met Glu Glu Gly Gly Asp Phe Asp Asn Tyr Tyr Gly Ala Asp Asn Gin 
1 5 10 15 

Ser Glu Cys Glu Tyr Thr Asp Trp Lys Ser Ser Gly Ala Leu lie Pro 
20 25 30 

Ala lie Tyr Met Leu Val Phe Leu Leu Gly Thr Thr Gly Asn Gly Leu 
35 40 45 

Val Leu Trp Thr Val Phe Arg Ser Ser Arg Glu Lys Arg Arg Ser Ala 
50 55 60 

Asp He Phe He Ala Ser Leu Ala Val Ala Asp Leu Thr Phe Val Val 
65 70 75 80 

Thr Leu Pro Leu Trp Ala Thr Tyr Thr Tyr Arg Asp Tyr Asp Trp Pro 
85 90 ' 95 

Phe Gly Thr Phe Phe Cys Lys Leu Ser Ser Tyr Leu He Phe Val Asn 
100 105 no 

Met Tyr Ala Ser Val Phe Cys Leu Thr Gly Leu Ser Phe Asp Arg Tyr 
115 120 125 

Leu Ala He Val Arg Pro Val Ala Asn Ala Arg Leu Arg Leu Arg Val 
130 135 140 

Ser Gly Ala Val Ala Thr Ala Val Leu Trp Val Leu Ala Ala Leu Leu 
145 150 155 160 

Ala Met Pro Val Met Val Leu Arg Thr Thr Gly Asp Leu Glu Asn Thr 
165 170 175 

Thr Lys Val Gin Cys Tyr Met Asp Tyr Ser Met Val Ala Thr Val Ser 
180 185 190 

Ser Glu Trp Ala Trp Glu Val Gly Leu Gly Val Ser Ser Thr Thr Val 
195 200 205 

Gly Phe Val Val Pro Phe Thr He Met Leu Thr Cys Tyr Phe Phe He 
210 215 220 



Ala Gin Thr He Ala Gly His Phe Arg Lys Glu Arg He Glu Gly Leu 
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225 230 235 240 

Arg Lys Arg Arg Arg Leu Leu Ser lie lie Val Val Leu Val Val Thr 
245 250 255 

Phe Ala Leu Cys Trp Met Pro Tyr His Leu Val Lys Thr Leu Tyr Met 
5 260 265 270 

Leu Gly Ser Leu Leu His Trp Pro Cys Asp Phe Asp Leu Phe Leu Met 
275 280 285 

Asn lie Phe Pro Tyr Cys Thr Cys lie Ser Tyr Val Asn Ser Cys Leu 
290 295 300 

10 Asn Pro Phe Leu Tyr Ala Phe Phe Asp Pro Arg Phe Arg Gin Ala Cys 

305 310 315 320 

Thr Ser Met Leu Cys Cys Gly Gin Ser Arg Cys Ala Gly Thr Ser His 
325 330 335 

Ser Ser Ser Gly Glu Lys Ser Ala Ser Tyr Ser Ser Gly His Ser Gin 
15 340 345 350 

Gly Pro Gly Pro Asn Met Gly Lys Gly Gly Glu Gin Met His Glu Lys 
355 360 365 

Ser He Pro Tyr Ser Gin Glu Thr Leu Val Val Asp 
370 375 380 

20 (64) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
TGAGAATTCT GGTGACTCAC AGCCGGCACA G ,31 
(65) INFORMATION FOR SEQ ID NO: 64: 

30 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
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GCCGGATCCA AGGAAAAGCA GCAATAAAAG G 



31 



10 



15 



20 



25 



(66) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

. (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

ATGAACTACC CGCTAACGCT GGAAATGGAC CTCGAGAACC TGGAGGACCT GTTCTGGGAA 60 

CTGGACAGAT TGGACAACTA TAACGACACC TCCCTGGTGG AAAATCATCT CTGCCCTGCC 120 

ACAGAGGGTC CCCTCATGGC CTCCTTCAAG GCCGTGTTCG TGCCCGTGGC CTACAGCCTC 180 

ATCTTCCTCC TGGGCGTGAT CGGCAACGTC CTGGTGCTGG TGATCCTGGA GCGGCACCGG 240 

CAGACACGCA GTTCCACGGA GACCTTCCTG TTCCACCTGG CCGTGGCCGA CCTCCTGCTG 300 

GTCTTCATCT TGCCCTTTGC CGTGGCCGAG GGCTCTGTGG GCTGGGTCCT GGGGACCTTC 360 

CTCTGCAAAA CTGTGATTGC CCTGCACAAA GTCAACTTCT ACTGCAGCAG CCTGCTCCTG 420 

GCCTGCATCG CCGTGGACCG CTACCTGGCC ATTGTCCACG CCGTCCATGC CTACCGCCAC 480 

CGCCGCCTCC TCTCCATCCA CATCACCTGT GGGACCATCT GGCTGGTGGG CTTCCTCCTT 540 

GCCTTGCCAG AGATTCTCTT CGCCAAAGTC AGCCAAGGCC ATCACAACAA CTCCCTGCCA 600 

CGTTGCACCT TCTCCCAAGA GAACCAAGCA GAAACGCATG CCTGGTTCAC CTCCCGATTC 660 

CTCTACCATG TGGCGGGATT CCTGCTGCCC ATGCTGGTGA TGGGCTGGTG CTACGTGGGG 720 

GTAGTGCACA GGTTGCGCCA GGCCCAGCGG CGCCCTCAGC GGCAGAAGGC AGTCAGGGTG 780 

GCCATCCTGG TGACAAGCAT CTTCTTCCTC TGCTGGTCAC CCTACCACAT CGTCATCTTC 840 

CTGGACACCC TGGCGAGGCT GAAGGCCGTG GACAATACCT GCAAGCTGAA TGGCTCTCTC 900 

CCCGTGGCCA TCACCATGTG TGAGTTCCTG GGCCTGGCCC ACTGCTGCCT CAACCCCATG 960 

CTCTACACTT TCGCCGGCGT GAAGTTCCGC AGTGACCTGT CGCGGCTCCT GACCAAGCTG 1020 

GGCTGTACCG GCCCTGCCTC CCTGTGCCAG CTCTTCCCTA GCTGGCGCAG GAGCAGTCTC 1080 

TCTGAGTCAG AGAATGCCAC CTCTCTCACC ACGTTCTAG 1119 

(67) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 



• 
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(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Met Asn Tyr Pro Leu Thr Leu Glu Met Asp Leu Glu Asn Leu Glu Asp 
15 10 is 

Leu Phe Trp Glu Leu Asp Arg Leu Asp Asn Tyr Asn Asp Thr Ser Leu 
20 25 30 

Val Glu Asn His Leu Cys Pro Ala Thr Glu Gly Pro Leu Met Ala Ser 
35 40 45 

Phe Lys Ala Val Phe Val Pro Val Ala Tyr Ser Leu He Phe Leu Leu 
50 55 60 

Gly Val lie Gly Asn Val Leu Val Leu Val He Leu Glu Arg His Arg 
65 70 75 80 

Gin Thr Arg Ser Ser Thr Glu Thr Phe Leu Phe His Leu Ala Val Ala 
85 90 95 

Asp Leu Leu Leu Val Phe He Leu Pro Phe Ala Val Ala Glu Gly Ser 
100 105 no 

Val Gly Trp Val Leu Gly Thr Phe Leu Cys Lys Thr Val He Ala Leu 
115 120 125 

His Lys Val Asn Phe Tyr Cys Ser Ser Leu Leu Leu Ala Cys He Ala 
130 135 140 

Val Asp Arg Tyr Leu Ala He Val His Ala Val His Ala Tyr Arg His 
145 150 155 160 

Arg Arg Leu Leu Ser He His He Thr Cys Gly Thr lie Trp Leu Val 
165 170 175 

Gly Phe Leu Leu Ala Leu Pro Glu He Leu Phe Ala Lys Val Ser Gin 
180 185 190 

Gly His His Asn Asn Ser Leu Pro Arg Cys Thr Phe Ser Gin Glu Asn 
195 200 205 

Gin Ala Glu Thr His Ala Trp Phe Thr Ser Arg Phe Leu Tyr His Val 
210 215 220 

Ala Gly Phe Leu Leu Pro Met Leu Val Met Gly Trp Cys Tyr Val Gly 
225 230 235 240 

Val Val His Arg Leu Arg Gin Ala Gin Arg Arg Pro Gin Arg Gin Lys 
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245 250 255 

Ala Val Arg Val Ala lie Leu Val Thr Ser He Phe Phe Leu Cys Trp 
260 265 270 

Ser Pro Tyr His He Val He Phe Leu Asp Thr Leu Ala Arg Leu Lys 
5 275 280 285 

Ala Val Asp Asn Thr Cys Lys Leu Asn Gly Ser Leu Pro Val Ala He 
290 295 300 

Thr Met Cys Glu Phe Leu Gly Leu Ala His Cys Cys Leu Asn Pro Met 
305 310 315 320 

10 Leu Tyr Thr Phe Ala Gly Val Lys Phe Arg Ser Asp Leu Ser Arg Leu 

325 330 335 

Leu Thr Lys Leu Gly Cys Thr Gly Pro Ala Ser Leu Cys Gin Leu Phe 
340 345 350 

Pro Ser Trp Arg Arg Ser Ser Leu Ser Glu Ser Glu Asn Ala Thr Ser 
15 355 360 365 

Leu Thr Thr Phe 
370 

(68) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

CAAAGCTTGA AAGCTGCACG GTGCAGAGAC 30 

(69) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
35 GCGGATCCCG AGTCACACCC TGGCTGGGCC 30 

(70) INFORMATION FOR SEQ ID NO: 69: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1128 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

ATGGATGTGA CTTCCCAAGC CCGGGGCGTG GGCCTGGAGA TGTACCCAGG CACCGCGCAG 60 

CCTGCGGCCC CCAACACCAC CTCCCCCGAG CTCAACCTGT CCCACCCGCT CCTGGGCACC 120 

GCCCTGGCCA ATGGGACAGG TGAGCTCTCG GAGCACCAGC AGTACGTGAT CGGCCTGTTC 180 

CTCTCGTGCC TCTACACCAT CTTCCTCTTC CCCATCGGCT TTGTGGGCAA CATCCTGATC 240 

CTGGTGGTGA ACATCAGCTT CCGCGAGAAG ATGACCATCC CCGACCTGTA CTTCATCAAC 300 

CTGGCGGTGG CGGACCTCAT CCTGGTGGCC GACTCCCTCA TTGAGGTGTT CAACCTGCAC 360 

GAGCGGTACT ACGACATCGC CGTCCTGTGC ACCTTCATGT CGCTCTTCCT GCAGGTCAAC 420 

ATGTACAGCA GCGTCTTCTT CCTCACCTGG ATGAGCTTCG ACCGCTACAT CGCCCTGGCC 480 

AGGGCCATGC GCTGCAGCCT GTTCCGCACC AAGCACCACG CCCGGCTGAG CTGTGGCCTC 540 

ATCTGGATGG CATCCGTGTC AGCCACGCTG GTGCCCTTCA CCGCCGTGCA CCTGCAGCAC 600 

ACCGACGAGG CCTGCTTCTG TTTCGCGGAT GTCCGGGAGG TGCAGTGGCT CGAGGTCACG 660 

CTGGGCTTCA TCGTGCCCTT CGCCATCATC GGCCTGTGCT ACTCCCTCAT TGTCCGGGTG 720 

CTGGTCAGGG CGCACCGGCA CCGTGGGCTG CGGCCCCGGC GGCAGAAGGC GCTCCGCATG 780 

ATCCTCGCGG TGGTGCTGGT CTTCTTCGTC TGCTGGCTGC CGGAGAACGT CTTCATCAGC 840 

GTGCACCTCC TGCAGCGGAC GCAGCCTGGG GCCGCTCCCT GCAAGCAGTC TTTCCGCCAT 900 

GCCCACCCCC TCACGGGCCA CATTGTCAAC CTCACCGCCT TCTCCAACAG CTGCCTAAAC 960 

CCCCTCATCT ACAGCTTTCT CGGGGAGACC TTCAGGGACA AGCTGAGGCT GTACATTGAG 1020 

CAGAAAACAA ATTTGCCGGC CCTGAACCGC TTCTGTCACG CTGCCCTGAA GGCCGTCATT 1080 

CCAGACAGCA CCGAGCAGTC GGATGTGAGG TTCAGCAGTG CCGTGTAG 1128 
(71) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Met Asp Val Thr Ser Gin Ala Arg Gly Val Gly Leu Glu Met Tyr Pro 
15 10 15 

Gly Thr Ala Gin Pro Ala Ala Pro Asn Thr Thr Ser Pro Glu Leu Asn 
20 25 30 

Leu Ser His Pro Leu Leu Gly Thr Ala Leu Ala Asn Gly Thr Gly Glu 
35 40 45 

Leu Ser Glu His Gin Gin Tyr Val lie Gly Leu Phe Leu Ser Cys Leu 
50 55 60 

Tyr Thr lie Phe Leu Phe Pro lie Gly Phe Val Gly Asn lie Leu lie 
65 70 75 80 

Leu Val Val Asn lie Ser Phe Arg Glu Lys Met Thr He Pro Asp Leu 
85 90 95 

Tyr Phe He Asn Leu Ala Val Ala Asp Leu He Leu Val Ala Asp Ser 
100 105 110 

Leu He Glu Val Phe Asn Leu His Glu Arg Tyr Tyr Asp He Ala Val 
115 120 125 

Leu Cys Thr Phe Met Ser Leu Phe Leu Gin Val Asn Met Tyr Ser Ser 
130 135 140 

Val Phe Phe Leu Thr Trp Met Ser Phe Asp Arg Tyr He Ala Leu Ala 
145 150 155 160 

Arg Ala Met Arg Cys Ser Leu Phe Arg Thr Lys His His Ala Arg Leu 
165 170 175 

Ser Cys Gly Leu He Trp Met Ala Ser Val Ser Ala Thr Leu Val Pro 
180 185 190 

Phe Thr Ala Val His Leu Gin His Thr Asp Glu Ala Cys Phe Cys Phe 
195 200 205 

Ala Asp Val Arg Glu Val Gin Trp Leu Glu Val Thr Leu Gly Phe He 
210 215 220 

Val Pro Phe Ala He He Gly Leu Cys Tyr Ser Leu He Val Arg Val 
225 230 235 240 

Leu Val Arg Ala His Arg His Arg Gly Leu Arg Pro Arg Arg Gin Lys 
245 250 255 

Ala Leu Arg Met He Leu Ala Val Val Leu Val Phe Phe Val Cys Trp 
260 265 270 
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Leu Pro Glu Asn Val Phe lie Ser Val His Leu Leu Gin Arg Thr Gin 
275 280 285 

Pro Gly Ala Ala Pro Cys Lys Gin Ser Phe Arg His Ala His Pro Leu 
29C 295 300 

5 Thr Gly His He Val Asn Leu Thr Ala Phe Ser Asn Ser Cys Leu Asn 

305 310 315 * 320 

Pro Leu He Tyr Ser Phe Leu Gly Glu Thr Phe Arg Asp Lys Leu Arg 
325 330 ~ 335 

Leu Tyr He Glu Gin Lys Thr Asn Leu Pro Ala Leu Asn Arg Phe Cys 
10 340 345 350 

His Ala Ala Leu Lys Ala Val He Pro Asp Ser Thr Glu Gin Ser Asp 
355 360 365 

Val Arg Phe Ser Ser Ala Val 
370 375 

15 (72) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
ACAGAATTCC TGTGTGGTTT TACCGCCCAG 30 

(73) INFORMATION FOR SEQ ID NO: 72: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CTCGGATCCA GGCAGAAGAG TCGCCTATGG 30 

(74) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

ATGGACCTGG GGAAACCAAT GAAAAGCGTG CTGGTGGTGG CTCTCCTTGT CATTTTCCAG 60 

5 GTATGCCTGT GTCAAGATGA GGTCACGGAC GATTACATCG GAGACAACAC CACAGTGGAC 120 

TACACTTTGT TCGAGTCTTT GTGCTCCAAG AAGGACGTGC GGAACTTTAA AGCCTGGTTC 180 

CTCCCTATCA TGTACTCCAT CATTTGTTTC GTGGGCCTAC TGGGCAATGG GCTGGTCGTG 240 

TTGACCTATA TCTATTTCAA GAGGCTCAAG ACCATGACCG ATACCTACCT GCTCAACCTG 300 

GCGGTGGCAG ACATCCTCTT CCTCCTGACC CTTCCCTTCT GGGCCTACAG CGCGGCCAAG 360 

10 TCCTGGGTCT TCGGTGTCCA CTTTTGCAAG CTCATCTTTG CCATCTACAA GATGAGCTTC 420 

TTCAGTGGCA TGCTCCTACT TCTTTGCATC AGCATTGACC GCTACGTGGC CATCGTCCAG 480 

GCTGTCTCAG CTCACCGCCA CCGTGCCCGC GTCCTTCTCA TCAGCAAGCT GTCCTGTGTG 540 

GGCATCTGGA TACTAGCCAC AGTGCTCTCC ATCCCAGAGC TCCTGTACAG TGACCTCCAG 600 

AGGAGCAGCA GTGAGCAAGC GATGCGATGC TCTCTCATCA CAGAGCATGT GGAGGCCTTT 660 

15 ATCACCATCC AGGTGGCCCA GATGGTGATC GGCTTTCTGG TCCCCCTGCT GGCCATGAGC 720 

TTCTGTTACC TTGTCATCAT CCGCACCCTG CTCCAGGCAC GCAACTTTGA GCGCAACAAG 780 

GCCATCAAGG TGATCATCGC TGTGGTCGTG GTCTTCATAG TCTTCCAGCT GCCCTACAAT 840 

GGGGTGGTCC TGGCCCAGAC GGTGGCCAAC TTCAACATCA CCAGTAGCAC CTGTGAGCTC 900 

AGTAAGCAAC TCAACATCGC CTACGACGTC ACCTACAGCC TGGCCTGCGT CCGCTGCTGC 960 

20 GTCAACCCTT TCTTGTACGC CTTCATCGGC GTCAAGTTCC GCAACGATCT CTTCAAGCTC 1020 

TTCAAGGACC TGGGCTGCCT CAGCCAGGAG CAGCTCCGGC AGTGGTCTTC CTGTCGGCAC 1080 

ATCCGGCGCT CCTCCATGAG TGTGGAGGCC GAGACCACCA CCACCTTCTC CCCATAG 1137 
(75) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 378 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
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Met Asp Leu Gly Lys Pro Met Lys Ser Val Leu Val Val Ala Leu Leu 
1 5 10 15 

Val lie Phe Gin Val Cys Leu Cys Gin Asp Glu Val Thr Asp Asp Tyr 
20 25 30 

lie Gly Asp Asn Thr Thr Val Asp Tyr Thr Leu Phe Glu Ser Leu Cys 
35 40 45 

Ser Lys Lys Asp Val Arg Asn Phe Lys Ala Trp Phe Leu Pro lie Met 
50 55 60 

Tyr Ser lie He Cys Phe Val Gly Leu Leu Gly Asn Gly Leu Val Val 
65 70 75 80 

Leu Thr Tyr He Tyr Phe Lys Arg Leu Lys Thr Met Thr Asp Thr Tyr 
85 90 95 

.Leu Leu Asn Leu Ala Val Ala Asp He Leu Phe Leu Leu Thr Leu Pro 
100 105 no 

Phe Trp Ala Tyr Ser Ala Ala Lys Ser Trp Val Phe Gly Val His Phe 
115 120 125 

Cys Lys Leu He Phe Ala lie Tyr Lys Met Ser Phe Phe Ser Gly Met 
130 135 140 

Leu Leu Leu Leu Cys He Ser He Asp Arg Tyr Val Ala He Val Gin 
145 150 155 160 

Ala Val Ser Ala His Arg His Arg Ala Arg Val Leu Leu He Ser Lys 
165 170 175 

Leu Ser Cys Val Gly He Trp He Leu Ala Thr Val Leu Ser He Pro 
180 185 190 

Glu Leu Leu Tyr Ser Asp Leu Gin Arg Ser Ser Ser Glu Gin Ala Met 
195 200 205 

Arg Cys Ser Leu He Thr Glu His Val Glu Ala Phe He Thr lie Gin 
210 215 220 

Val Ala Gin Met Val lie Gly Phe Leu Val Pro Leu Leu Ala Met Ser 
225 230 235 240 

Phe Cys Tyr Leu Val lie lie Arg Thr Leu Leu Gin Ala Arg Asn Phe 
245 250 255 

Glu Arg Asn Lys Ala He Lys Val He He Ala Val Val Val Val Phe 
260 265 270 



lie Val Phe Gin Leu Pro Tyr Asn Gly Val Val Leu Ala Gin Thr Val 
275 280 285 
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Ala Asn Phe Asn He Thr Ser Ser Thr Cys Glu Leu Ser Lys Gin Leu 
290 295 300 

Asn He Ala Tyr Asp Val Thr Tyr Ser Leu Ala Cys Val Arg Cys Cys 
305 310 315 320 

Val Asn Pro Phe Leu Tyr Ala Phe He Gly Val Lys Phe Arg Asn Asp 
325 330 335 

Leu Phe Lys Leu Phe Lys Asp Leu Gly Cys Leu Ser Gin Glu Gin Leu 
340 345 350 

Arg Gin Trp Ser Ser Cys Arg His He Arg Arg Ser Ser Met Ser Val 
355 360 365 

Glu Ala Glu Thr Thr Thr Thr Phe Ser Pro 
370 375 

(76) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CTGGAATTCA CCTGGACCAC CACCAATGGA TA 32 

(77) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
CTCGGATCCT GCAAAGTTTG TCATACAGTT 30 

(78) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1085 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

ATGGATATAC AAATGGCAAA CAATTTTACT CCGCCCTCTG CAACTCCTCA GGGAAATGAC $0 

TGTGACCTCT ATGCACATCA CAGCACGGCC AGGATAGTAA TGCCTCTGCA TTACAGCCTC 120 

GTCTTCATCA TTGGGCTCGT GGGAAACTTA CTAGCCTTGG TCGTCATTGT TCAAAACAGG 180 

AAAAAAATCA ACTCTACCAC CCTCTATTCA ACAAATTTGG TGATTTCTGA TATACTTTTT 240 

ACCACGGCTT TGCCTACACG AATAGCCTAC TATGCAATGG GCTTTGACTG GAGAATCGGA 300 

GATGCCTTGT GTAGGATAAC TGCGCTAGTG TTTTACATCA ACACATATGC AGGTGTGAAC 360 

TTTATGACCT GCCTGAGTAT TGACCGCTTC ATTGCTGTGG TGCACCCTCT ACGCTACAAC 420 

AAGATAAAAA GGATTGAACA TGCAAAAGGC GTGTGCATAT TTGTCTGGAT TCTAGTATTT 480 

GCTCAGACAC TCCCACTCCT CATCAACCCT ATGTCAAAGC AGGAGGCTGA AAGGATTACA 540 

TGCATGGAGT ATCCAAACTT TGAAGAAACT AAATCTCTTC CCTGGATTCT GCTTGGGGCA 600 

TGTTTCATAG GATATGTACT TCCACTTATA ATCATTCTCA TCTGCTATTC TCAGATCTGC 660 

TGCAAACTCT TCAGAACTGC CAAACAAAAC CCACTCACTG AGAAATCTGG TGTAAACAAA 720 

AAGGCTCTCA ACACAATTAT TCTTATTATT GTTGTGTTTG TTCTCTGTTT CACACCTTAC 780 

CATGTTGCAA TTATTCAACA TATGATTAAG AAGCTTCGTT TCTCTAATTT CCTGGAATGT 840 

AGCCAAAGAC ATTCGTTCCA GATTTCTCTG CACTTTACAG TATGCCTGAT GAACTTCAAT 900 

TGCTGCATGG ACCCTTTTAT CTACTTCTTT GCATGTAAAG GGTATAAGAG AAAGGTTATG 960 

AGGATGCTGA AACGGCAAGT CAGTGTATCG ATTTCTAGTG CTGTGAAGTC AGCCCCTGAA 1020 

GAAAATTCAC GTGAAATGAC AGAAACGCAG ATGATGATAC ATTCCAAGTC TTCAAATGGA 1080 

AAGTGA 1086 
(79) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
Met Asp lie Gin Met Ala Asn Asn Phe Thr Pro Pro Ser Ala Thr Pro 



1 



5 



10 



15 
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Gin Gly Asn Asp Cys Asp Leu Tyr Ala His His Ser Thr Ala Arg lie 
20 25 30 

Val Met Pro Leu His Tyr Ser Leu Val Phe He He Gly Leu Val Gly 
35 40 45 

5 Asn Leu Leu Ala Leu Val Val He Val Gin Asn Arg Lys Lys He Asn 

50 55 60 

Ser Thr Thr Leu Tyr Ser Thr Asn Leu Val He Ser Asp He Leu Phe 
65 70 75 80 

Thr Thr Ala Leu Pro Thr Arg He Ala Tyr Tyr Ala Met Gly Phe Asp 
10 85 90 95 

Trp Arg He Gly Asp Ala Leu Cys Arg He Thr Ala Leu Val Phe Tyr 
100 105 110 

He Asn Thr Tyr Ala Gly Val Asn Phe Met Thr Cys Leu Ser He Asp 
115 120 125 

15 Arg Phe He Ala Val Val His Pro Leu Arg Tyr Asn Lys He Lys Arg 

130 135 140 

He Glu His Ala Lys Gly Val Cys He Phe Val Trp He Leu Val Phe 
145 150 155 160 

Ala Gin Thr Leu Pro Leu Leu He Asn Pro Met Ser Lys Gin Glu Ala 
20 165 170 175 

Glu Arg He Thr Cys Met Glu Tyr Pro Asn Phe Glu Glu Thr Lys Ser 
180 185 190 

Leu Pro Trp He Leu Leu Gly Ala Cys Phe He Gly Tyr Val Leu Pro 
195 200 205 

25 Leu He He He Leu lie Cys Tyr Ser Gin He Cys Cys Lys Leu Phe 

210 215 220 

Arg Thr Ala Lys Gin Asn Pro Leu Thr Glu Lys Ser Gly Val Asn Lys 
225 230 235 240 

Lys Ala Leu Asn Thr lie lie Leu He lie Val Val Phe Val Leu Cys 
30 245 250 255 

Phe Thr Pro Tyr His Val Ala lie lie Gin His Met lie Lys Lys Leu 
260 265 270 

Arg Phe Ser Asn Phe Leu Glu Cys Ser Gin Arg His Ser Phe Gin lie 
275 280 285 

35 Ser Leu His Phe Thr Val Cys Leu Met Asn Phe Asn Cys Cys Met Asp 

290 295 300 



Pro Phe He Tyr Phe Phe Ala Cys Lys Gly Tyr Lys Arg Lys Val Met 
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305 310 315 320 

Arg Met Leu Lys Arg Gin Val Ser Val Ser lie Ser Ser Ala Val Lys 
325 330 335 

Ser Ala Pro Glu Glu Asn Ser Arg Glu Met Thr Glu Thr Gin Met Met 
340 345 350 

lie His Ser Lys Ser Ser Asn Gly Lys 
355 360 

(80) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
CTGGAATTCT CCTGCTCATC CAGCCATGCG G 

(81) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 
CCTGGATCCC CACCCCTACT GGGGCCTCAG 

(82) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
ATGCGGTGGC TGTGGCCCCT GGCTGTCTCT CTTGCTGTGA TTTTGGCTGT GGGGCTAAGC 



AGGGTCTCTG GGGGTGCCCC CCTGCACCTG GGCAGGCACA GAGCCGAGAC CCAGGAGCAG 120 
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CAGAGCCQAT CCAAGAGGGG CACCGAGGAT GAGGAGGCCA AGGGCGTGCA GCAGTATGTG 180 

CCTGAGGAGT GGGCGGAGTA CCCCCGGCCC ATTCACCCTG CTGGCCTGCA GCCAACCAAG 240 

CCCTTGGTGG CCACCAGCCC TAACCCCGAC AAGGATGGGG GCACCCCAGA CAGTGGGCAG 300 

GAACTGAGGG GCAATCTGAC AGGGGCACCA GGGCAGAGGC TACAGATCCA GAACCCCCTG 360 

5 TATCCGGTGA CCGAGAGCTC CTACAGTGCC TATGCCATCA TGCTTCTGGC GCTGGTGGTG 420 

TTTGCGGTGG GCATTGTGGG CAACCTGTCG GTCATGTGCA TCGTGTGGCA CAGCTACTAC 480 

CTGAAGAGCG CCTGGAACTC CATCCTTGCC AGCCTGGCCC TCTGGGATTT TCTGGTCCTC 540 

TTTTTCTGCC TCCCTATTGT CATCTTCAAC GAGATCACCA AGCAGAGGCT ACTGGGTGAC 600 

GTTTCTTGTC GTGCCGTGCC CTTCATGGAG GTCTCCTCTC TGGGAGTCAC GACTTTCAGC 660 

10 CTCTGTGCCC TGGGCATTGA CCGCTTCCAC GTGGCCACCA GCACCCTGCC CAAGGTGAGG 720 

CCCATCGAGC GGTGCCAATC CATCCTGGCC AAGTTGGCTG TCATCTGGGT GGGCTCCATG 780 

ACGCTGGCTG TGCCTGAGCT CCTGCTGTGG CAGCTGGCAC AGGAGCCTGC CCCCACCATG 840 

GGCACCCTGG ACTCATGCAT CATGAAACCC TCAGCCAGCC TGCCCGAGTC CCTGTATTCA 900 

CTGGTGATGA CCTACCAGAA CGCCCGCATG TGGTGGTACT TTGGCTGCTA CTTCTGCCTG 960 

15 CCCATCCTCT TCACAGTCAC CTGCCAGCTG GTGACATGGC GGGTGCGAGG CCCTCCAGGG 1020 

AGGAAGTCAG AGTGCAGGGC CAGCAAGCAC GAGCAGTGTG AGAGCCAGCT CAACAGCACC 1080 

GTGGTGGGCC TGACCGTGGT CTACGCCTTC TGCACCCTCC CAGAGAACGT CTGCAACATC 114 0 

GTGGTGGCCT ACCTCTCCAC CGAGCTGACC CGCCAGACCC TGGACCTCCT GGGCCTCATC 1200 

AACCAGTTCT CCACCTTCTT CAAGGGCGCC ATCACCCCAG TGCTGCTCCT TTGCATCTGC 1260 

20 AGGCCGCTGG GCCAGGCCTT CCTGGACTGC TGCTGCTGCT GCTGCTGTGA GGAGTGCGGC 1320 

GGGGCTTCGG AGGCCTCTGC TGCCAATGGG TCGGACAACA AGCTCAAGAC CGAGGTGTCC 1380 

TCTTCCATCT ACTTCCACAA GCCCAGGGAG TCACCCCCAC TCCTGCCCCT GGGCACACCT 1440 

TGCTGA 1446 
(83) INFORMATION FOR SEQ ID NO: 82: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

30. (ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Met Arg Trp Leu Trp Pro Leu Ala Val Ser Leu Ala Val lie Leu Ala 
1 5 10 15 

Val Gly Leu Ser Arg Val Ser Gly Gly Ala Pro Leu His Leu Gly Arg 
5 20 25 30 

His Arg Ala Glu Thr Gin Glu Gin Gin Ser Arg Ser Lys Arg Gly Thr 
35 40 45 

Glu Asp Glu Glu Ala Lys Gly Val Gin Gin Tyr Val Pro Glu Glu Trp 
50 55 60 

10 Ala Glu Tyr Pro Arg Pro He His Pro Ala Gly Leu Gin Pro Thr Lys 

65 70 75 80 

Pro Leu Val Ala Thr Ser Pro Asn Pro Asp Lys Asp Gly Gly Thr Pro 
85 90 95 

Asp Ser Gly Gin Glu Leu Arg Gly Asn Leu Thr Gly Ala Pro Gly Gin 
15 100 105 no 

Arg Leu Gin He Gin Asn Pro Leu Tyr Pro Val Thr Glu Ser Ser Tyr 
115 120 125 

Ser Ala Tyr Ala He Met Leu Leu Ala Leu Val Val Phe Ala Val Gly 
130 135 140 

20 He Val Gly Asn Leu Ser Val Met Cys He Val Trp His Ser Tyr Tyr 

14 5 150 155 160 

Leu Lys Ser Ala Trp Asn Ser He Leu Ala Ser Leu Ala Leu Trp Asp 
165 170 175 

Phe Leu Val Leu Phe Phe Cys Leu Pro He Val He Phe Asn Glu He 
25 180 185 190 

Thr Lys Gin Arg Leu Leu Gly Asp Val Ser Cys Arg Ala Val Pro Phe 
195 200 205 

Met Glu Val Ser Ser Leu Gly Val Thr Thr Phe Ser Leu Cys Ala Leu 
210 215 220 

30 Gly He Asp Arg Phe His Val Ala Thr Ser Thr Leu Pro Lys Val Arg 

225 230 235 240 

Pro He Glu Arg Cys Gin Ser He Leu Ala Lys Leu Ala Val He Trp 
245 250 255 

Val Gly Ser Met Thr Leu Ala Val Pro Glu Leu Leu Leu Trp Gin Leu 
35 260 265 270 

Ala Gin Glu Pro Ala Pro Thr Met Gly Thr Leu Asp Ser Cys He Met 
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275 280 285 

Lys Pro Ser Ala Ser Leu Pro Glu Ser Leu Tyr Ser Leu Val Met Thr 
290 295 300 

Tyr Gin Asn Ala Arg Met Trp Trp Tyr Phe Gly Cys Tyr Phe Cys Leu 
5 305 310 315 320 

Pro He Leu Phe Thr Val Thr Cys Gin Leu Val Thr Trp Arg Val Arg 
325 330 335 

Gly Pro Pro Gly Arg Lys Ser Glu Cys Arg Ala Ser Lys His Glu Gin 
340 345 350 

10 Cys Glu Ser Gin Leu Asn Ser Thr Val Val Gly Leu Thr Val Val Tyr 

355 360 365 

Ala Phe Cys Thr Leu Pro Glu Asn Val Cys Asn He Val Val Ala Tyr 
370 375 380 

Leu Ser Thr Glu Leu Thr Arg Gin Thr Leu Asp Leu Leu Gly Leu He 
15 385 390 395 400 

Asn Gin Phe Ser Thr Phe Phe Lys Gly Ala He Thr Pro Val Leu Leu 
405 410 415 

Leu Cys He Cys Arg Pro Leu Gly Gin Ala Phe Leu Asp Cys Cys Cys 
420 425 430 

20 Cys Cys Cys Cys Glu Glu Cys Gly Gly Ala Ser Glu Ala Ser Ala Ala 

435 440 445 

Asn Gly Ser Asp Asn Lys Leu Lys Thr Glu Val Ser Ser Ser He Tyr 
450 455 460 

Phe His Lys Pro Arg Glu . Ser Pro Pro Leu Leu Pro Leu Gly Thr Pro 
25 465 470 475 480 

Cys 



(84) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 



ATGTGGAACG CGACGCCCAG CG 



22 
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(85) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
TCATGTATTA ATACTAGATT CT 

(86) INFORMATION FOR SEQ ID NO:85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
TACCATGTGG AACGCGACGC CCAGCGAAGA GCCGGGGT 

(87) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
CGGAATTCAT GTATTAATAC TAGATTCTGT CCAGGCCCG 

(88) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 



WO 00/22129 PCT/US99/23938 

67 

ATGTGGAACG CGACGCCCAG CGAAGAGCCG GGGTTCAACC TGACACTGGC CGACCTGGAC 60 

TGGGATGCTT CCCCCGGCAA CGACTCGCTG GGCGACGAGC TGCTGCAGCT CTTCCCCGCG 120 

CCGCTGCTGG CGGGCGTCAC AGCCACCTGC GTGGCACTCT TCGTGGTGGG TATCGGTGGC 180 

AACCTGCTCA CCATGCTGGT GGTGTCGCGC TTCCGCGAGC TGCGCACCAC CACCAACCTC 240 

5 TACCTGTCCA GCATGGCCTT CTCCGATCTG CTCATCTTCC TCTGCATGCC CCTGGACCTC 300 

GTTCGCCTCT GGCAGTACCG GCCCTGGAAC TTCGGCGACC TCCTCTGCAA ACTCTTCCAA 360 

TTCGTCAGTG AGAGCTGCAC CTACGCCACG GTGCTCACCA TCACAGCGCT GAGCGTCGAG 420 

CGCTACTTCG CCATCTGCTT CCCACTCCGG GCCAAGGTGG TGGTCACCAA GGGGCGGGTG 480 

AAGCTGGTCA TCTTCGTCAT CTGGGCCGTG GCCTTCTGCA GCGCCGGGCC CATCTTCGTG 540 

10 CTAGTCGGGG TGGAGCACGA GAACGGCACC GACCCTTGGG ACACCAACGA GTGCCGCCCC 600 

ACCGAGTTTG CGGTGCGCTC TGGACTGCTC ACGGTCATGG TGTGGGTGTC CAGCATCTTC 660 

TTCTTCCTTC CTGTCTTCTG TCTCACGGTC CTCTACAGTC TCATCGGCAG GAAGCTGTGG 720 

CGGAGGAGGC GCGGCGATGC TGTCGTGGGT GCCTCGCTCA GGGACCAGAA CCACAAGCAA 780 

ACCGTGAAAA TGCTGGCTGT AGTGGTGTTT GCCTTCATCC TCTGCTGGCT CCCCTTCCAC 840 

15 GTAGGGCGAT ATTTATTTTC CAAATCCTTT GAGCCTGGCT CCTTGGAGAT TGCTCAGATC 900 

AGCCAGTACT GCAACCTCGT GTCCTTTGTC CTCTTCTACC TCAGTGCTGC CATCAACCCC 960 

ATTCTGTACA ACATCATGTC CAAGAAGTAC CGGGTGGCAG TGTTCAGACT TCTGGGATTC 1020 

GAACCCTTCT CCCAGAGAAA GCTCTCCACT CTGAAAGATG AAAGTTCTCG GGCCTGGACA 1080 

GAATCTAGTA TTAATACATG A 1101 

20 (89) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 366 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

25 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Met Trp Asn Ala Thr Pro Ser Glu Glu Pro Gly Phe Asn Leu Thr Leu 
1 5 10 15 

30 Ala Asp Leu Asp Trp Asp Ala Ser Pro Gly Asn Asp Ser Leu Gly Asp 

20 25 30 
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Glu Leu Leu Gin Leu Phe Pro Ala Pro Leu Leu Ala Gly Val Thr Ala 
35 40 45 

Thr Cys Val Ala Leu Phe Val Val Gly lie Ala Gly Asn Leu Leu Thr 
50 55 60 

Met Leu Val Val Ser Arg Phe Arg Glu Leu Arg Thr Thr Thr Asn Leu 
65 70 75 80 

Tyr Leu Ser Ser Met Ala Phe Ser Asp Leu Leu lie Phe Leu Cys Met 
85 90 95 

Pro Leu Asp Leu Val Arg Leu Trp Gin Tyr Arg Pro Trp Asn Phe Gly 
100 105 ~ HO 

Asp Leu Leu Cys Lys Leu Phe Gin Phe Val Ser Glu Ser Cys Thr Tyr 
115 120 125 

Ala Thr Val Leu Thr lie Thr Ala Leu Ser Val Glu Arg Tyr Phe Ala 
130 135 140 

lie Cys Phe Pro Leu Arg Ala Lys Val Val Val Thr Lys Gly Arg Val 
145 150 155 160 

Lys Leu Val lie Phe Val lie Trp Ala Val Ala Phe Cys Ser Ala Gly 
165 170 175 

Pro lie Phe Val Leu Val Gly Val Glu His Glu Asn Gly Thr Asp Pro 
180 185 190 

Trp Asp Thr Asn Glu Cys Arg Pro Thr Glu Phe Ala Val Arg Ser Gly 
195 200 205 

Leu Leu Thr Val Met Val Trp Val Ser Ser lie Phe Phe Phe Leu Pro 
210 215 220 

Val Phe Cys Leu Thr Val Leu Tyr Ser Leu lie Gly Arg Lys Leu Trp 
225 230 235 240 

Arg Arg Arg Arg Gly Asp Ala Val Val Gly Ala Ser Leu Arg Asp Gin 
245 250 255 

Asn His Lys Gin Thr Val Lys Met Leu Ala Val Val Val Phe Ala Phe 
260 265 270 

lie Leu Cys Trp Leu Pro Phe His Val Gly Arg Tyr Leu Phe Ser Lys 
275 280 285 

Ser Phe Glu Pro Gly Ser Leu Glu lie Ala Gin lie Ser Gin Tyr Cys 
290 295 300 



Asn Leu Val Ser Phe Val Leu Phe Tyr Leu Ser Ala Ala lie Asn Pro 
305 310 315 320 



lie Leu Tyr Asn lie Met Ser Lys Lys Tyr Arg Val Ala Val Phe Arg 
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325 



330 



335 



Leu Leu Gly 



Phe Glu Pro Phe Ser 
340 



Gin Arg Lys Leu Ser Thr Leu Lys 
345 350 



Asp Glu Ser 
355 



Ser Arg Ala Tip Thr 
360 



Glu Ser Ser lie Asn Thr 
365 



(90) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
GCAAGCTTGT GCCCTCACCA AGCCATGCGA GCC 33 

(91) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
CGGAATTCAG CAATGAGTTC CGACAGAAGC 30 

(92) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

ATGCGAGCCC CGGGCGCGCT TCTCGCCCGC ATGTCGCGGC TACTGCTTCT GCTACTGCTC 60 

AAGGTGTCTG CCTCTTCTGC CCTCGGGGTC GCCCCTGCGT CCAGAAACGA AACTTGTCTG 120 

GGGGAGAGCT GTGCACCTAC AGTGATCCAG CGCCGCGGCA GGGACGCCTG GGGACCGGGA 180 

AATTCTGCAA GAGACGTTCT GCGAGCCCGA GCACCCAGGG AGGAGCAGGG GGCAGCGTTT 240 
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CTTGCGGGAC CCTCCTGGGA CCTGCCGGCG 
GGGGCGGAGG CGTCGGCAGC CGGACCCCCG 
AGGTGGAAAG GTGCTCGGGG TCAGGAGCCT 
GCCCTCCAGC TCTTCCTTCA GATCTCAGAG 
5 ATTTCCGGGC GTAGCCAGGA GCAGAGTGTG 
TACTGGCCAA GGAGAGCCGG GAAACTCCAG 
GCCAATGGAC TGGCGGGGCA CGAAGGGTGG 
CAGAATGGAT CCTTGGGTGA AGGAATCCAT 
ACGAACCGGC GTGTGAGACT GAAGAACCCC 

10 GCCTACGCGG TCATGTGTCT GTCCGTGGTG 
GCGGTGATGA GCATCGTGTG CCACAACTAC 
GCCAACCTGG CCTTCTGGGA CTTTCTCATC 
CACGAGCTGA CCAAGAAGTG GCTGCTGGAG 
GAGGTCGCTT CTCTGGGAGT CACCACTTTC 

15 CGTGCTGCCA CCAACGTACA GATGTACTAC 
GCCAAACTTG CTGTTATATG GGTGGGAGCT 
CGCCAGCTGA GCAAGGAGGA TTTGGGGTTT 
ATTAAGATCT CTCCTGATTT ACCAGACACC 
GCGAGACTGT GGTGGTATTT TGGCTGTTAC 

20 TGCTCTCTAG TGACTGCGAG GAAAATCCGC 
AAACGGCAGA TTCAACTAGA GAGTCAGATG 
TATGGATTTT GCATTATTCC TGAAAATATC 
GGGGTTTCAC AGCAGACAAT GGACCTCCTT 
AAGTCCTGTG TCACCCCAGT CCTCCTTTTC 

25 ATGGAGTGCT GCTGCTGTTG CTGTGAGGAA 
GATGACAATG ACAACGAGTA CACCACGGAA 
CGTGAAATGT CCACTTTTGC TTCTGTCGGA 
(93) INFORMATION FOR SEQ ID NO: 9: 



70 

GCCCCGGGCC GTGACCCGGC TGCAGGCAGA 300 

GGACCTCCAA CCAGGCCACC TGGCCCCTGG 360 

TCTGAAACTT TGGGGAGAGG GAACCCCACG 420 

GAGGAAGAGA AGGGTCCCAG AGGCGCTGGC 480 

AAGACAGTCC CCGGAGCCAG CGATCTTTTT 540 

GGTTCCCACC ACAAGCCCCT GTCCAAGACG 600 

ACAATTGCAC TCCCGGGCCG GGCGCTGGCC 660 

GAGCCTGGGG GTCCCCGCCG GGGAAACAGC 720 

TTCTACCCGC TGACCCAGGA GTCCTATGGA 780 

ATCTTCGGGA CCGGCATCAT TGGCAACCTG 840 

TACATGCGGA GCATCTCCAA CTCCCTCTTG 900 

ATCTTCTTCT GCCTTCCGCT GGTCATCTTC 960 

GACTTCTCCT GCAAGATCGT GCCCTATATA 1020 

ACCTTATGTG CTCTGTGCAT AGACCGCTTC 1080 

GAAATGATCG AAAACTGTTC CTCAACAACT 1140 

CTATTGTTAG CACTTCCAGA AGTTGTTCTC 1200 

AGTGGCCGAG CTCCGGCAGA AAGGTGCATT 1260 

ATCTATGTTC TAGCCCTCAC CTACGACAGT 1320 

TTTTGTTTGC CCACGCTTTT CACCATCACC 1380 

AAAGCAGAGA AAGCCTGTAC CCGAGGGAAT 1440 

AACTGTACAG TAGTGGCACT GACCATTTTA 1500 

TGCAACATTG TTACTGCCTA CATGGCTACA 1560 

AATATCATCA GCCAGTTCCT TTTGTTCTTT 1620 

TGTCTCTGCA AACCCTTCAG TCGGGCCTTC 1680 

TGCATTCAGA AGTCTTCAAC GGTGACCAGT 1740 

CTCGAACTCT CGCCTTTCAG TACCATACGC 1800 

ACTCATTGCT GA 1842 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 613 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Met Arg Ala Pro Gly Ala Leu Leu Ala Arg Met Ser Arg Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Lys Val Ser Ala Ser Ser Ala Leu Gly Val Ala Pro 
20 25 30 

Ala Ser Arg Asn Glu Thr Cys Leu Gly Glu Ser Cys Ala Pro Thr Val 
35 40 45 

He Gin Arg Arg Gly Arg Asp Ala Trp Gly Pro Gly Asn Ser Ala Arg 
50 55 60 

Asp Val Leu Arg Ala Arg Ala Pro Arg Glu Glu Gin Gly Ala Ala Phe 
65 70 75 80 

Leu Ala Gly Pro Ser Trp Asp Leu Pro Ala Ala Pro Gly Arg Asp Pro 
85 90 95 

Ala Ala Gly Arg Gly Ala Glu Ala Ser Ala Ala Gly Pro Pro Gly Pro 
100 105 110 

Pro Thr Arg Pro Pro Gly Pro Trp Arg Trp Lys Gly Ala Arg Gly Gin 
115 120 125 

Glu Pro Ser Glu Thr Leu Gly Arg Gly Asn Pro Thr Ala Leu Gin Leu 
130 135 140 

Phe Leu Gin He Ser Glu Glu Glu Glu Lys Gly Pro Arg Gly Ala Gly 
145 150 155 160 

lie Ser Gly Arg Ser Gin Glu Gin Ser Val Lys Thr Val Pro Gly Ala 
165 170 175 

Ser Asp Leu Phe Tyr Trp Pro Arg Arg Ala Gly Lys Leu Gin Gly Ser 
180 185 190 

His His Lys Pro Leu Ser Lys Thr Ala Asn Gly Leu Ala Gly His Glu 
195 200 205 

Gly Trp Thr He Ala Leu Pro Gly Arg Ala Leu Ala Gin Asn Gly Ser 
210 215 220 



Leu Gly Glu Gly He His Glu Pro Gly Gly Pro Arg Arg Gly Asn Ser 
225 230 235 240 
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Thr Asn Arg Arg Val Arg Leu Lys Asn Pro Phe Tyr Pro Leu Thr Gin 
245 250 255 

Glu Ser Tyr Gly Ala Tyr Ala Val Met Cys Leu Ser Val Val He Phe 
260 265 270 

Gly Thr Gly He He Gly Asn Leu Ala Val Met Ser He Val Cys His 
275 280 285 

Asn Tyr Tyr Met Arg Ser He Ser Asn Ser Leu Leu Ala Asn Leu Ala 
290 295 300 

Phe Trp Asp Phe Leu He He Phe Phe Cys Leu Pro Leu Val He Phe 
305 310 315 320 

His Glu Leu Thr Lys Lys Trp Leu Leu Glu Asp Phe Ser Cys Lys He 
325 330 * 335 

Val Pro Tyr He Glu Val Ala Ser Leu Gly Val Thr Thr Phe Thr Leu 
340 345 350 

Cys Ala Leu Cys He Asp Arg Phe Arg Ala Ala Thr Asn Val Gin Met 
355 360 365 

Tyr Tyr Glu Met He Glu Asn Cys Ser Ser Thr Thr Ala Lys Leu Ala 
370 375 380 

Val He Trp Val Gly Ala Leu Leu Leu Ala Leu Pro Glu Val Val Leu 
385 390 395 400 

Arg Gin Leu Ser Lys Glu Asp Leu Gly Phe Ser Gly Arg Ala Pro Ala 
405 410 415 

Glu Arg Cys He lie Lys He Ser Pro Asp Leu Pro Asp Thr He Tyr 



Cys Tyr Phe Cys Leu Pro Thr Leu Phe Thr He Thr Cys Ser Leu Val 
450 455 460 

Thr Ala Arg Lys He Arg Lys Ala Glu Lys Ala Cys Thr Arg Gly Asn 
465 470 475 480 

Lys Arg Gin He Gin Leu Glu Ser Gin Met Asn Cys Thr Val Val Ala 
485 490 495 

Leu Thr He Leu Tyr Gly Phe Cys He He Pro Glu Asn He Cys Asn 
500 505 510 

He Val Thr Ala Tyr Met Ala Thr Gly Val Ser Gin Gin Thr Met Asp 



420 



425 



430 



Val Leu Ala Leu Thr Tyr Asp Ser 
435 440 



Ala Arg Leu Trp Trp Tyr Phe Gly 
445 



515 



520 



525 



Leu Leu Asn He He Ser Gin Phe 



Leu Leu Phe Phe Lys Ser Cys Val 
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530 



535 



540 



Thr Pro Val Leu Leu 
545 



Phe 
550 



Cys 



Leu Cys Lys Pro Phe Ser Arg Ala Phe 
555 560 



Met Glu Cys Cys Cys 
565 



Cys 



Cys 



Cys Glu Glu Cys lie Gin Lys Ser Ser 
570 575 



Thr Val Thr Ser Asp 
580 



Asp 



Asn 



Asp Asn Glu Tyr Thr Thr Glu Leu Glu 
585 590 



Leu Ser Pro Phe Ser 
595 



Thr 



lie 



Arg Arg Glu Met Ser Thr Phe Ala Ser 
600 605 



Val Gly Thr His Cys 
610 

(94) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
CAGAATTCAG AGAAAAAAAG TGAATATGGT TTTT 34 

(95) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
TTGGATCCCT GGTGCATAAC AATTGAAAGA AT 32 

(96) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1248 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

ATGGTTTTTG CTCACAGAAT GGATAACAGC AAGCCACATT TGATTATTCC TACACTTCTG 60 

GTGCCCCTCC AAAACCGCAG CTGCACTGAA ACAGCCACAC CTCTGCCAAG CCAATACCTG 120 

ATGGAATTAA GTGAGGAGCA CAGTTGGATG AGCAACCAAA CAGACCTTCA CTATGTGCTG 180 

5 AAACCCGGGG AAGTGGCCAC AGCCAGCATC TTCTTTGGGA TTCTGTGGTT GTTTTCTATC 240 

TTCGGCAATT CCCTGGTTTG TTTGGTCATC CATAGGAGTA GGAGGACTCA GTCTACCACC 300 

AACTACTTTG TGGTCTCCAT GGCATGTGCT GACCTTCTCA TCAGCGTTGC CAGCACGCCT 360 

TTCGTCCTGC TCCAGTTCAC CACTGGAAGG TGGACGCTGG GTAGTGCAAC GTGCAAGGTT 420 

GTGCGATATT TTCAATATCT CACTCCAGGT GTCCAGATCT ACGTTCTCCT CTCCATCTGC 480 

10 ATAGACCGGT TCTACACCAT CGTCTATCCT CTGAGCTTCA AGGTGTCCAG AGAAAAAGCC 540 

AAGAAAATGA TTGCGGCATC GTGGATCTTT GATGCAGGCT TTGTGACCCC TGTGCTCTTT 600 

TTCTATGGCT CCAACTGGGA CAGTCATTGT AACTATTTCC TCCCCTCCTC TTGGGAAGGC 660 

ACTGCCTACA CTGTCATCCA CTTCTTGGTG GGCTTTGTGA TTCCATCTGT CCTCATAATT 720 

TTATTTTACC AAAAGGTCAT AAAATATATT TGGAGAATAG GCACAGATGG CCGAACGGTG 780 

15 AGGAGGACAA TGAACATTGT CCCTCGGACA AAAGTGAAAA CTATCAAGAT GTTCCTCATT 840 

TTAAATCTGT TGTTTTTGCT CTCCTGGCTG CCTTTTCATG TAGCTCAGCT ATGGCACCCC 900 

CATGAACAAG ACTATAAGAA AAGTTCCCTT GTTTTCACAG CTATCACATG GATATCCTTT 960 

AGTTCTTCAG CCTCTAAACC TACTCTGTAT TCAATTTATA ATGCCAATTT TCGGAGAGGG 1020 

ATGAAAGAGA CTTTTTGCAT GTCCTCTATG AAATGTTACC GAAGCAATGC CTATACTATC 1080 

20 ACAACAAGTT CAAGGATGGC CAAAAAAAAC TACGTTGGCA TTTCAGAAAT CCCTTCCATG 1140 

GCCAAAACTA TTACCAAAGA CTCGATCTAT GACTCATTTG ACAGAGAAGC CAAGGAAAAA 1200 

AAGCTTGCTT GGCCCATTAA CTCAAATCCA CCAAATACTT TTGTCTAA 1248 
(97) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 415 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



WO 00/22129 PCI7US99/23938 

75 



Met Val Phe Ala His Arg Met Asp Asn Ser Lys Pro His Leu He He 
15 10 15 

Pro Thr Leu Leu Val Pro Leu Gin Asn Arg Ser Cys Thr Glu Thr Ala 
20 25 30 

5 Thr Pro Leu Pro Ser Gin Tyr Leu Met Glu Leu Ser Glu Glu His Ser 

35 40 45 

Trp Met Ser Asn Gin Thr Asp Leu His Tyr Val Leu Lys Pro Gly Glu 
50 55 60 

Val Ala Thr Ala Ser He Phe Phe Gly He Leu Trp Leu Phe Ser He 
10 65 70 75 80 

Phe Gly Asn Ser Leu Val Cys Leu Val He His Arg Ser Arg Arg Thr 
85 90 95 

Gin Ser Thr Thr Asn Tyr Phe Val Val Ser Met Ala Cys Ala Asp Leu 
100 105 110 

15 Leu He Ser Val Ala Ser Thr Pro Phe Val Leu Leu Gin Phe Thr Thr 

115 120 125 

Gly Arg Trp Thr Leu Gly Ser Ala Thr Cys Lys Val Val Arg Tyr Phe 
130 135 140 

Gin Tyr Leu Thr Pro Gly Val Gin He Tyr Val Leu Leu Ser He Cys 
20 145 150 155 160 

He Asp Arg Phe Tyr Thr He Val Tyr Pro Leu Ser Phe Lys Val Ser 
165 170 175 

Arg Glu Lys Ala Lys Lys Met He Ala Ala Ser Trp He Phe Asp Ala 
180 185 190 

25 Gly Phe Val Thr Pro Val Leu Phe Phe Tyr Gly Ser Asn Trp Asp Ser 

195 200 205 

His Cys Asn Tyr Phe Leu Pro Ser Ser Trp Glu Gly Thr Ala Tyr Thr 
210 215 220 

Val He His Phe Leu Val Gly Phe Val He Pro Ser Val Leu He He 
30 225 230 235 240 

Leu Phe Tyr Gin Lys Val He Lys Tyr He Trp Arg He Gly Thr Asp 
245 250 255 

Gly Arg Thr Val Arg Arg Thr Met Asn He Val Pro Arg Thr Lys Val 
260 265 270 



Lys Thr He Lys Met Phe Leu He 
275 280 



Leu Asn Leu Leu Phe Leu Leu Ser 
285 
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Trp Leu Pro Phe His Val Ala Gin Leu Trp His Pro His Glu Gin Asp 
290 295 300 

Tyr Lys Lys Ser Ser Leu Val Phe Thr Ala lie Thr Trp lie Ser Phe 
305 310 315 320 

5 Ser Ser Ser Ala Ser Lys Pro Thr Leu Tyr Ser lie Tyr Asn Ala Asn 

325 330 335 

Phe Arg Arg Gly Met Lys Glu Thr Phe Cys Met Ser Ser Met Lys Cys 
340 345 350 

Tyr Arg Ser Asn Ala Tyr Thr lie Thr Thr Ser Ser Arg Met Ala Lys 
10 355 360 365 

Lys Asn Tyr Val Gly He Ser Glu He Pro Ser Met Ala Lys Thr He 
370 375 380 

Thr Lys Asp Ser He Tyr Asp Ser Phe Asp Arg Glu Ala Lys Glu Lys 
385 390 395 400 

15 Lys Leu Ala Trp Pro He Asn Ser Asn Pro Pro Asn Thr Phe Val 

405 410 415 

(98) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 

20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
25 GGAAAGCTTA ACGATCCCCA GGAGCAACAT 30 

(99) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
CTGGGATCCT ACGAGAGCAT TTTTCACACA G 31 
35 (100) INFORMATION FOR SEQ ID NO: 99: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 





ATGGGGCCCA 


CCCTAGCGGT TCCCACCCCC TATGGCTGTA TTGGCTGTAA 


GCTACCCCAG 


60 




CCAGAATACC 


CACCGGCTCT AATCATCTTT ATGTTCTGCG CGATGGTTAT 


CACCATCGTT 


120 




GTAGACCTAA 


TCGGCAACTC CATGGTCATT TTGGCTGTGA CGAAGAACAA 


GAAGCTCCGG 


180 


10 


AATTCTGGCA ACATCTTCGT GGTCAGTCTC TCTGTGGCCG ATATGCTGGT GGCCATCTAC 


240 




CCATACCCTT 


TGATGCTGCA TGCCATGTCC ATTGGGGGCT GGGATCTGAG 


CCAGTTACAG 


300 




TGCCAGATGG 


TCGGGTTCAT CACAGGGCTG AGTGTGGTCG GCTCCATCTT 


CAACATCGTG 


360 




GCAATCGCTA 


TCAACCGTTA CTGCTACATC TGCCACAGCC TCCAGTACGA 


ACGGATCTTC 


420 




AGTGTGCGCA 


ATACCTGCAT CTACCTGGTC ATCACCTGGA TCATGACCGT 


CCTGGCTGTC 


480 


15 


CTGCCCAACA 


TGTACATTGG CACCATCGAG TACGATCCTC GCACCTACAC 


CTGCATCTTC. 


540 




AACTATCTGA ACAACCCTGT CTTCACTGTT ACCATCGTCT GCATCCACTT 


CGTCCTCCCT 


600 




CTCCTCATCG 


TGGGTTTCTG CTACGTGAGG ATCTGGACCA AAGTGCTGGC 


GGCCCGTGAC 


660 




CCTGCAGGGC 


AGAATCCTGA CAACCAACTT GCTGAGGTTC GCAATTTTCT 


AACCATGTTT 


720 




GTGATCTTCC 


TCCTCTTTGC AGTGTGCTGG TGCCCTATCA ACGTGCTCAC 


TGTCTTGGTG 


780 


20 


GCTGTCAGTC 


CGAAGGAGAT GGCAGGCAAG ATCCCCAACT GGCTTTATCT 


TGCAGCCTAC 


840 




TTCATAGCCT 


ACTTCAACAG CTGCCTCAAC GCTGTGATCT ACGGGCTCCT 


CAATGAGAAT 


900 




TTCCGAAGAG 


AATACTGGAC CATCTTCCAT GCTATGCGGC ACCCTATCAT 


ATTCTTCCCT 


960 




GGCCTCATCA 


GTGATATTCG TGAGATGCAG GAGGCCCGTA CCCTGGCCCG 


CGCCCGTGCC 


1020 




CATGCTCGCG 


ACCAAGCTCG TGAACAAGAC CGTGCCCATG CCTGTCCTGC 


TGTGGAGGAA 


1080 


25 


ACCCCGATGA ATGTCCGGAA TGTTCCATTA CCTGGTGATG CTGCAGCTGG CCACCCCGAC 


1140 




CGTGCCTCTG 


GCCACCCTAA GCCCCATTCC AGATCCTCCT CTGCCTATCG 


CAAATCTGCC 


1200 




TCTACCCACC ACAAGTCTGT CTTTAGCCAC TCCAAGGCTG CCTCTGGTCA 


CCTCAAGCCT 


1260 




GTCTCTGGCC 


ACTCCAAGCC TGCCTCTGGT CACCCCAAGT CTGCCACTGT 


CTACCCTAAG 


1320 




CCTGCCTCTG 


TCCATTTCAA GGGTGACTCT GTCCATTTCA AGGGTGACTC 


TGTCCATTTC 


1380 



WO 00/22129 PCT/US99/23938 

78 

AAGCCTGACT CTGTTCATTT CAAGCCTGCT TCCAGCAACC CCAAGCCCAT CACTGGCCAC 1440 

CATGTCTCTG CTGGCAGCCA CTCCAAGTCT GCCTTCAGTG CTGCCACCAG CCACCCTAAA 1500 

CCCATCAAGC CAGCTACCAG CCATGCTGAG CCCACCACTG CTGACTATCC CAAGCCTGCC 15C0 

ACTACCAGCC ACCCTAAGCC CGCTGCTGCT GACAACCCTG AGCTCTCTGC CTCCCATTGC 1620 

5 CCCGAGATCC CTGCCATTGC CCACCCTGTG TCTGACGACA GTGACCTCCC TGAGTCGGCC 1680 

TCTAGCCCTG CCGCTGGGCC CACCAAGCCT GCTGCCAGCC AGCTGGAGTC TGACACCATC 1740 

GCTGACCTTC CTGACCCTAC TGTAGTCACT ACCAGTACCA ATGATTACCA TGATGTCGTG 1800 

GTTGTTGATG TTGAAGATGA TCCTGATGAA ATGGCTGTGT GA 1842 
(101) INFORMATION FOR SEQ ID NO: 100: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 613 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

15 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: 

Met Gly Pro Thr Leu Ala Val Pro Thr Pro Tyr Gly Cys lie Gly Cys 
1 5 10 15 

Lys Leu Pro Gin Pro Glu Tyr Pro Pro Ala Leu lie lie Phe Met Phe 
20 20 25 30 

Cys Ala Met Val He Thr He Val Val Asp Leu He Gly Asn Ser Met 
35 40 45 

Val He Leu Ala Val Thr Lys Asn Lys Lys Leu Arg Asn Ser Gly Asn 
50 55 60 

25 He Phe Val Val Ser Leu Ser Val Ala Asp Met Leu Val Ala He -Tyr 

65 70 75 80 

Pro Tyr Pro Leu Met Leu His Ala Met Ser He Gly Gly Trp Asp Leu 
85 90 95 

Ser Gin Leu Gin Cys Gin Met Val Gly Phe He Thr Gly Leu Ser Val 
30 100 105 110 

Val Gly Ser He Phe Asn He Val Ala He Ala He Asn Arg Tyr Cys 
115 120 125 

Tyr He Cys His Ser Leu Gin Tyr Glu Arg He Phe Ser Val Arg Asn 
130 135 140 
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Thr Cys lie Tyr Leu Val He Thr Trp He Met Thr Val Leu Ala Val 
145 150 155 160 

Leu Pro Asn Met Tyr He Gly Thr He Glu Tyr Asp Pro Arg Thr Tyr 
165 170 175 

5 Thr Cys lie Phe Asn Tyr Leu Asn Asn Pro Val Phe Thr Val Thr He 

180 185 190 

Val Cys lie His Phe Val Leu Pro Leu Leu He Val Gly Phe Cys Tyr 
195 200 205 

Val Arg lie Trp Thr Lys Val Leu Ala Ala Arg Asp Pro Ala Gly Gin 
10 210 215 220 

Asn Pro Asp Asn Gin Leu Ala Glu Val Arg Asn Phe Leu Thr Met Phe 
225 230 235 240 

Val He Phe Leu Leu Phe Ala Val Cys Trp Cys Pro He Asn Val Leu 
245 250 255 

15 Thr Val Leu Val Ala Val Ser Pro Lys Glu Met Ala Gly Lys He Pro 

260 265 270 

Asn Trp Leu Tyr Leu Ala Ala Tyr Phe lie Ala Tyr Phe Asn Ser Cys 
275 280 285 

Leu Asn Ala Val lie Tyr Gly Leu Leu Asn Glu Asn Phe Arg Arg Glu 
20 290 295 300 

Tyr Trp Thr lie Phe His Ala Met Arg His Pro lie lie Phe Phe Pro 
305 310 315 320 

Gly Leu lie Ser Asp He Arg Glu Met Gin Glu Ala Arg Thr Leu Ala 
325 330 335 

25 Arg -Ala Arg Ala His Ala Arg Asp Gin Ala Arg Glu Gin Asp Arg Ala 

340 345 350 

His Ala Cys Pro Ala Val Glu Glu Thr Pro Met Asn Val Arg Asn Val 
355 360 365 

Pro Leu Pro Gly Asp Ala Ala Ala Gly His Pro Asp Arg Ala Ser Gly 
30 370 375 380 

His Pro Lys Pro His Ser Arg Ser Ser Ser Ala Tyr Arg Lys Ser Ala 
385 390 395 400 

Ser Thr His His Lys Ser Val Phe Ser His Ser Lys Ala Ala Ser Gly 
405 410 415 

35 His Leu Lys Pro Val Ser Gly His Ser Lys Pro Ala Ser Gly His Pro 

420 425 430 



Lys Ser Ala Thr Val Tyr Pro Lys Pro Ala 



Ser Val His Phe Lys 



Gly 



WO 00/22129 PCT/US99/23938 

80 

435 440 445 

Asp Ser Val His Phe Lys Gly Asp Ser Val His Phe Lys Pro Asp Ser 
450 455 460 

Val His Phe Lys Pro Ala Ser Ser Asn Pro Lys Pro lie Thr Gly His 
465 470 475 480 

His Val Ser Ala Gly Ser His Ser Lys Ser Ala Phe Ser Ala Ala Thr 
485 490 495 

Ser His Pro Lys Pro lie Lys Pro Ala Thr Ser His Ala Glu Pro Thr 
500 505 510 

Thr Ala Asp Tyr Pro Lys Pro Ala Thr Thr Ser His Pro Lys Pro Ala 
515 520 525 

Ala Ala Asp Asn Pro Glu Leu Ser Ala Ser His Cys Pro Glu lie Pro 
530 535 540 

Ala lie Ala His Pro Val Ser Asp Asp Ser Asp Leu Pro Glu Ser Ala 
545 550 555 560 

Ser Ser Pro Ala Ala Gly Pro Thr Lys Pro Ala Ala Ser Gin Leu Glu 
565 570 575 

Ser Asp Thr He Ala Asp Leu Pro Asp Pro Thr Val Val Thr Thr Ser 
580 585 590 

Thr Asn Asp Tyr His Asp Val Val Val Val Asp Val Glu Asp Asp Pro 
595 600 605 

Asp Glu Met Ala Val 
610 



(102) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 



TCCAAGCTTC GCCATGGGAC ATAACGGGAG CT 
(103) INFORMATION FOR SEQ ID NO: 102: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
CGTGAATTCC AAGAATTTAC AATCCTTGCT 
5 (104) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 





ATGGGACATA 


ACGGGAGCTG 


GATCTCTCCA AATGCCAGCG AGCCGCACAA CGCGTCCGGC 


60 




GCCGAGGCTG 


CGGGTGTGAA 


CCGCAGCGCG CTCGGGGAGT TCGGCGAGGC GCAGCTGTAC 


120 


15 


CGCCAGTTCA 


CCACCACCGT 


GCAGGTCGTC ATCTTCATAG GCTCGCTGCT CGGAAACTTC 


180 




ATGGTGTTAT 


GGTCAACTTG 


CCGCACAACC GTGTTCAAAT CTGTCACCAA CAGGTTCATT 


240 




AAAAACCTGG 


CCTGCTCGGG 


GATTTGTGCC AGCCTGGTCT GTGTGCCCTT CGACATCATC 


300 




CTCAGCACCA 


GTCCTCACTG 


TTGCTGGTGG ATCTACACCA TGCTCTTCTG CAAGGTCGTC 


360 




AAATTTTTGC 


ACAAAGTATT 


CTGCTCTGTG ACCATCCTCA GCTTCCCTGC TATTGCTTTG 


420 


20 


GACAGGTACT 


ACTCAGTCCT 


CTATCCACTG GAGAGGAAAA TATCTGATGC CAAGTCCCGT 


480 




GAACTGGTGA 


TGTACATCTG 


GGCCCATGCA GTGGTGGCCA GTGTCCCTGT GTTTGCAGTA 


540 




ACCAATGTGG 


CTGACATCTA 


TGCCACGTCC ACCTGCACGG AAGTCTGGAG CAACTCCTTG 


600 




GGCCACCTGG 


TGTACGTTCT 


GGTGTATAAC ATCACCACGG TCATTGTGCC TGTGGTGGTG 


660 




GTGTTCCTCT 


TCTTGATACT 


GATCCGACGG GCCCTGAGTG CCAGCCAGAA GAAGAAGGTC 


720 


25 


ATCATAGCAG 


CGCTCCGGAC 


CCCACAGAAC ACCATCTCTA TTCCCTATGC CTCCCAGCGG 


780 




GAGGCCGAGC 


TGCACGCCAC 


CCTGCTCTCC ATGGTGATGG TCTTCATCTT GTGTAGCGTG 


840 




CCCTATGCCA 


CCCTGGTCGT 


CTACCAGACT GTGCTCAATG TCCCTGACAC TTCCGTCTTC 


900 




TTGCTGCTCA 


CTGCTGTTTG 


GCTGCCCAAA GTCTCCCTGC TGGCAAACCC TGTTCTCTTT 


960 




CTTACTGTGA ACAAATCTGT 


CCGCAAGTGC TTGATAGGGA CCCTGGTGCA ACTACACCAC 


1020 


30 


CGGTACAGTC 


GCCGTAATGT 


GGTCAGTACA GGGAGTGGCA TGGCTGAGGC CAGCCTGGAA 


1080 
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CCCAGCATAC GCTCGGGTAG CCAGCTCCTG GAGATGTTCC ACATTGGGCA GCAGCAGATC 1140 

TTTAAGCCCA CAGAGGATGA GGAAGAGAGT GAGGCCAAGT ACATTGGCTC AGCTGACTTC 1200 

CAGGCCAAGG AGATATTTAG CACCTGCCTC GAGGGAGAGC AGGGGCCACA GTTTGCGCCC 1260 

TCTGCCCCAC CCCTGAGCAC AGTGGACTCT GTATCCCAGG TGGCACCGGC AGCCCCTGTG 1320 

5 GAACCTGAAA CATTCCCTGA TAAGTATTCC CTGCAGTTTG GCTTTGGGCC TTTTGAGTTG 1380 

CCTCCTCAGT GGCTCTCAGA GACCCGAAAC AGCAAGAAGC GGCTGCTTCC CCCCTTGGGC 1440 

AACACCCCAG AAGAGCTGAT CCAGACAAAG GTGCCCAAGG TAGGCAGGGT GGAGCGGAAG 1500 

ATGAGCAGAA ACAATAAAGT GAGCATTTTT CCAAAGGTGG ATTCCTAG 1548 
(105) INFORMATION FOR SEQ ID NO:104: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

15 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Met Gly His Asn Gly Ser Trp lie Ser Pro Asn Ala Ser Glu Pro His 
1.5 10 15 

Asn Ala Ser Gly Ala Glu Ala Ala Gly Val Asn Arg Ser Ala Leu Gly 
20 20 2 5 ~ 30 

Glu Phe Gly Glu Ala Gin Leu Tyr Arg Gin Phe Thr Thr Thr Val Gin 
35 40 45 

Val Val lie Phe He Gly Ser Leu Leu Gly Asn Phe Met Val Leu Trp 
50 55 60 

25 Ser Thr Cys Arg Thr Thr Val Phe Lys Ser Val Thr Asn Arg Phe He 

65 70 75 80 

Lys Asn Leu Ala Cys Ser Gly He Cys Ala Ser Leu Val Cys Val Pro 
85 90 95 

Phe Asp He He Leu Ser Thr Ser Pro His Cys Cys Trp Trp He Tyr 
30 100 105 no 

Thr Met Leu Phe Cys Lys Val Val Lys Phe Leu His Lys Val Phe Cys 
115 120 125 

Ser Val Thr He Leu Ser Phe Pro Ala He Ala Leu Asp Arg Tyr Tyr 
130 135 140 
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Ser Val Leu Tyr Pro Leu Glu Arg Lys lie Ser Asp Ala Lys Ser Arg 
145 150 155 160 

Glu Leu Val Met Tyr lie Trp Ala His Ala Val Val Ala Ser Val Pro 
165 170 175 

Val Phe Ala Val Thr Asn Val Ala Asp lie Tyr Ala Thr Ser Thr Cys 
180 185 190 

Thr Glu Val Trp Ser Asn Ser Leu Gly His Leu Val Tyr Val Leu Val 
195 200 205 

Tyr Asn lie Thr Thr Val He Val Pro Val Val Val Val Phe Leu Phe 
210 215 220 

Leu He Leu He Arg Arg Ala Leu Ser Ala Ser Gin Lys Lys Lys Val 
225 230 235 240 

He He Ala Ala Leu Arg Thr Pro Gin Asn Thr He Ser He Pro Tyr 
245 250 255 

Ala Ser Gin Arg Glu Ala Glu Leu His Ala Thr Leu Leu Ser Met Val 
260 265 270 

Met Val Phe He Leu Cys Ser Val Pro Tyr Ala Thr Leu Val Val Tyr 
275 280 285 

Gin Thr Val Leu Asn Val Pro Asp Thr Ser Val Phe Leu Leu Leu Thr 
290 295 300 

Ala Val Trp Leu Pro Lys Val Ser Leu Leu Ala Asn Pro Val Leu Phe 
305 310 315 320 

Leu Thr Val Asn Lys Ser Val Arg Lys Cys Leu He Gly Thr Leu Val 
325 330 335 

Gin Leu His His Arg Tyr Ser Arg Arg Asn Val Val Ser Thr Gly Ser 
340 345 350 

Gly Met Ala Glu Ala Ser Leu Glu Pro Ser He Arg Ser Gly Ser Gin 
355 360 365 

Leu Leu Glu Met Phe His He Gly Gin Gin Gin He Phe Lys Pro Thr 
370 375 380 

Glu Asp Glu Glu Glu Ser Glu Ala Lys Tyr He Gly Ser Ala Asp Phe 
385 390 395 400 

Gin Ala Lys Glu He Phe Ser Thr Cys Leu Glu Gly Glu Gin Gly Pro 
405 410 415 



Gin Phe Ala Pro Ser Ala Pro Pro Leu Ser Thr Val Asp Ser Val Ser 
420 425 430 



Gin Val Ala Pro Ala Ala Pro Val Glu Pro Glu Thr Phe Pro Asp Lys 
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435 440 445 

Tyr Ser Leu Gin Phe Gly Phe Gly Pro Phe Glu Leu Pro Pro Gin Trp 
450 455 460 

Leu Ser Glu Thr Arg Asn Ser Lys Lys Arg Leu Leu Pro Pro Leu Gly 
5 465 470 475 480 

Asn Thr Pro Glu Glu Leu He Gin Thr Lys Val Pro Lys Val Gly Arg 
485 490 495 

Val Glu Arg Lys Met Ser Arg Asn Asn Lys Val Ser lie Phe Pro Lys 
500 505 sio 

10 val Asp Ser 

515 

(106) INFORMATION FOR SEQ ID NO:105: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 

15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
20 GGAGAATTCA CTAGGCGAGG CGCTCCATC 29 

(107) INFORMATION FOR SEQ ID NO:106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: 
GGAGGATCCA GGAAACCTTA GGCCGAGTCC 30 
30 (108) INFORMATION FOR SEQ ID NO:107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1164 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60 

TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTT 120 

GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180 

5 TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT CATCTGCCTG 240 

CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA ACTTTGGGGA CATCCCTTGC 300 

CGGCTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360 

GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420 

AATTGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 480 

10 CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 540 

AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 600 

CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 660 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 720 

GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 780 

15 TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGCGTTCTT TATCACTCTC 840 

AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 900 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960 

CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020 

GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1080 

20 ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1140 

CAGTTGGGCT GTTGCATCGA GTAA 1164 
(109) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 387 amino acids 

25 (B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
30 Met Asn Arg His His Leu Gin Asp His Phe Leu Glu lie Asp Lys Lys 
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10 



15 



Asn Cys Cys Val Phe Arg Asp Asp Phe He Ala Lys Val Leu Pro Pro 
20 25 30 

Val Leu Gly Leu Glu Phe He Phe Gly Leu Leu Gly Asn Gly Leu Ala 
35 40 45 

Leu Trp He Phe Cys Phe His Leu Lys Ser Trp Lys Ser Ser Arg He 
50 55 60 



10 



Phe Leu Phe Asn Leu Ala Val Ala Asp Phe Leu Leu He He Cys Leu 
65 70 75 80 

Pro Phe Val Met Asp Tyr Tyr Val Arg Arg Ser Asp Trp Asn Phe Gly 
85 90 95 



Asp He Pro Cys Arg Leu Val Leu Phe Met Phe Ala Met Asn Arg Gin 
100 105 no 

Gly Ser He He Phe Leu Thr Val Val Ala Val Asp Arg Tyr Phe Arg 
15 115 120 125 

Val Val His Pro His His Ala Leu Asn Lys He Ser Asn Trp Thr Ala 
130 135 140 



20 



Ala He He Ser Cys Leu Leu Trp Gly He Thr Val Gly Leu Thr Val 
145 150 155 160 

His Leu Leu Lys Lys Lys Leu Leu He Gin Asn Gly Pro Ala Asn Val 
165 170 175 



Cys He Ser Phe Ser He Cys His Thr Phe Arg Trp His Glu Ala Met 
180 185 190 

Phe Leu Leu Glu Phe Leu Leu Pro Leu Gly He He Leu Phe Cys Ser 
25 195 200 205 

Ala Arg He He Trp Ser Leu Arg Gin Arg Gin Met Asp Arg His Ala 
210 215 220 



30 



Lys He Lys Arg Ala He Thr Phe lie Met Val Val Ala He Val Phe 
225 230 235 240 

Val He Cys Phe Leu Pro Ser Val Val Val Arg He Arg He Phe Trp 
245 250 255 



Leu Leu His Thr Ser Gly Thr Gin Asn Cys Glu Val Tyr Arg Ser Val 
260 265 270 

Asp Leu Ala Phe Phe He Thr Leu Ser Phe Thr Tyr Met Asn Ser Met 

35 275 280 285 



Leu Asp Pro Val Val Tyr Tyr Phe Ser Ser Pro Ser Phe Pro Asn Phe 
290 295 300 
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Phe Ser Thr Leu lie Asn Arg Cys Leu Gin Arg Lys Met Thr Gly Glu 
305 310 315 320 

Pro Asp Asn Asn Arg Ser Thr Ser Val Glu Leu Thr Gly Asp Pro Asn 
325 330 335 

5 Lys Thr Arg Gly Ala Pro Glu Ala Leu Met Ala Asn Ser Gly Glu Pro 

340 345 350 

Trp Ser Pro Ser Tyr Leu Gly Pro Thr Ser Asn Asn His Ser Lys Lys 
355 360 365 

Gly His Cys His Gin Glu Pro Ala Ser Leu Glu Lys Gin Leu Gly Cys 
10 370 375 380 



Cys lie Glu 
385 

(110) INFORMATION FOR SKQ ID NO: 109: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
20 (iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
ACCATGGCTT GCAATGGCAG TGCGGCCAGG GGGCACT 37 
(111) INFORMATION FOR SEQ ID NO: 110: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
30 (iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:110: 
CGACCAGGAC AAACAGCATC TTGGTCACTT GTCTCCGGC 39 
(112) INFORMATION FOR SEQ ID NO: 111: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
GACCAAGATG CTGTTTGTCC TGGTCGTGGT GTTTGGCAT 3 g 

(113) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
15 (iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
CGGAATTCAG GATGGATCGG TCTCTTGCTG CGCCT 35 

(114) INFORMATION FOR SEQ ID NO:113: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

ATGGCTTGCA ATGGCAGTGC GGCCAGGGGG CACTTTGACC CTGAGGACTT GAACCTGACT 60 

GACGAGGCAC TGAGACTCAA GTACCTGGGG CCCCAGCAGA CAGAGCTGTT CATGCCCATC 120 

TGTGCCACAT ACCTGCTGAT CTTCGTGGTG GGCGCTGTGG GCAATGGGCT GACCTGTCTG 180 

GTCATCCTGC GCCACAAGGC CATGCGCACG CCTACCAACT ACTACCTCTT CAGCCTGGCC 240 

30 GTGTCGGACC TGCTGGTGCT GCTGGTGGGC CTGCCCCTGG AGCTCTATGA GATGTGGCAC 300 

AACTACCCCT TCCTGCTGGG CGTTGGTGGC TGCTATTTCC GCACGCTACT GTTTGAGATG 360 

GTCTGCCTGG CCTCAGTGCT CAACGTCACT GCCCTGAGCG TGGAACGCTA TGTGGCCGTG 420 

GTGCACCCAC TCCAGGCCAG GTCCATGGTG ACGCGGGCCC ATGTGCGCCG AGTGCTTGGG 480 
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GCCGTCTGGG GTCTTGCCAT GCTCTGCTCC CTGCCCAACA CCAGCCTGCA CGGCATCCGG 540 

CAGCTGCACG TGCCCTGCCG GGGCCCAGTG CCAGACTCAG CTGTTTGCAT GCTGGTCCGC 600 

CCACGGGCCC TCTACAACAT GGTAGTGCAG ACCACCGCGC TGCTCTTCTT CTGCCTGCCC 660 

ATGGCCATCA TGAGCGTGCT CTACCTGCTC ATTGGGCTGC GACTGCGGCG GGAGAGGCTG 720 

5 CTGCTCATGC AGGAGGCCAA GGGCAGGGGC TCTGCAGCAG CCAGGTCCAG ATACACCTGC 780 

AGGCTCCAGC AGCACGATCG GGGCCGGAGA CAAGTGACCA AGATGCTGTT TGTCCTGGTC 840 

GTGGTGTTTG GCATCTGCTG GGCCCCGTTC CACGCCGACC GCGTCATGTG GAGCGTCGTG 900 

TCACAGTGGA CAGATGGCCT GCACCTGGCC TTCCAGCACG TGCACGTCAT CTCCGGCATC 960 

TTCTTCTACC TGGGCTCGGC GGCCAACCCC GTGCTCTATA GCCTCATGTC CAGCCGCTTC 1020 

10 CGAGAGACCT TCCAGGAGGC CCTGTGCCTC GGGGCCTGCT GCCATCGCCT CAGACCCCGC 1080 

CACAGCTCCC ACAGCCTCAG CAGGATGACC ACAGGCAGCA CCCTGTGTGA TGTGGGCTCC 1140 

CTGGGCAGCT GGGTCCACCC CCTGGCTGGG AACGATGGCC CAGAGGCGCA GCAAGAGACC 1200 

GATCCATCCT GA 1212 
(115) INFORMATION FOR SEQ ID NO: 114: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

20 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Met Ala Cys Asn Gly Ser Ala Ala Arg Gly His Phe Asp Pro Glu Asp 
15 10 15 

Leu Asn Leu Thr Asp Glu Ala Leu Arg Leu Lys Tyr Leu Gly Pro Gin 
25 20 25 30 

Gin Thr Glu Leu Phe Met Pro lie Cys Ala Thr Tyr Leu Leu lie Phe 
35 40 45 

Val Val Gly Ala Val Gly Asn Gly Leu Thr Cys Leu Val lie Leu Arg 
50 55 60 

30 His Lys Ala Met Arg Thr Pro Thr Asn Tyr Tyr Leu Phe Ser Leu Ala 

65 70 75 80 

Val Ser Asp Leu Leu Val Leu Leu Val Gly Leu Pro Leu Glu Leu Tyr 
85 90 95 
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Glu Met Trp His Asn Tyr Pro Phe Leu Leu Gly Val Gly Gly Cys Tyr 
100 105 no 

Phe Arg Thr Leu Leu Phe Glu Met Val Cys Leu Ala Ser Val Leu Asn 
115 120 125 

Val Thr Ala Leu Ser Val Glu Arg Tyr Val Ala Val Val His Pro Leu 
130 135 140 

Gin Ala Arg Ser Met Val Thr Arg Ala His Val Arg Arg Val Leu Gly 
145 150 155 ^ i 6 o 

Ala Val Trp Gly Leu Ala Met Leu Cys Ser Leu Pro Asn Thr Ser Leu 
165 170 175 

His Gly He Arg Gin Leu His Val Pro Cys Arg Gly Pro Val Pro Asp 
180 185 190 

Ser Ala Val Cys Met Leu Val Arg Pro Arg Ala Leu Tyr Asn Met Val 
195 200 205 

Val Gin Thr Thr Ala Leu Leu Phe Phe Cys Leu Pro Met Ala He Met 
210 215 220 

Ser Val Leu Tyr Leu Leu He Gly Leu Arg Leu Arg Arg Glu Arg Leu 
225 230 235 240 

Leu Leu Met Gin Glu Ala Lys Gly Arg Gly Ser Ala Ala Ala Arg Ser 
245 250 255 

Arg Tyr Thr Cys Arg Leu Gin Gin His Asp Arg Gly Arg Arg Gin Val 
260 265 270 

Thr Lys Met Leu Phe Val Leu Val Val Val Phe Gly He Cys Trp Ala 
275 280 285 

Pro Phe His Ala Asp Arg Val Met Trp Ser Val Val Ser Gin Trp Thr 
290 295 300 

Asp Gly Leu His Leu Ala Phe Gin His Val His Val He Ser Gly He 
305 310 315 320 

Phe Phe Tyr Leu Gly Ser Ala Ala Asn Pro Val Leu Tyr Ser Leu Met 
325 330 335 

Ser Ser Arg Phe Arg Glu Thr Phe Gin Glu Ala Leu Cys Leu Gly Ala 
340 345 350 

Cys Cys His Arg Leu Arg Pro Arg His Ser Ser His Ser Leu Ser Arg 
355 360 365 

Met Thr Thr Gly Ser Thr Leu Cys Asp Val Gly Ser Leu Gly Ser Trp 
370 375 380 

Val His Pro Leu Ala Gly Asn Asp Gly Pro Glu Ala Gin Gin Glu Thr 
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385 390 395 400 

Asp Pro Ser 

(116) INFORMATION FOR SEQ ID NO: 115: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GGAAGCTTCA GGCCCAAAGA TGGGGAACAT 30 

(117) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:116: 

GTGGATCCAC CCGCGGAGGA CCCAGGCTAG 30 

(118) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1098 base pairs 

25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

30 ATGGGGAACA TCACTGCAGA CAACTCCTCG ATGAGCTGTA CCATCGACCA TACCATCCAC 60 

CAGACGCTGG CCCCGGTGGT CTATGTTACC GTGCTGGTGG TGGGCTTCCC GGCCAACTGC 120 

CTGTCCCTCT ACTTCGGCTA CCTGCAGATC AAGGCCCGGA ACGAGCTGGG CGTGTACCTG .180 

TGCAACCTGA CGGTGGCCGA CCTCTTCTAC ATCTGCTCGC TGCCCTTCTG GCTGCAGTAC 240 

GTGCTGCAGC ACGACAACTG GTCTCACGGC GACCTGTCCT GCCAGGTGTG CGGCATCCTC 300 



35 CTGTACGAGA ACATCTACAT CAGCGTGGGC TTCCTCTGCT GCATCTCCGT GGACCGCTAC 360 
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CTGGCTGTGG CCCATCCCTT CCGCTTCCAC CAGTTCCGGA CCCTGAAGGC GGCCGTCGGC 420 

GTCAGCGTGG TCATCTGGGC CAAGGAGCTG CTGACCAGCA TCTACTTCCT GATGCACGAG 480 

GAGGTCATCG AGGACGAGAA CCAGCACCGC GTGTGCTTTG AGCACTACCC CATCCAGGCA 540 

TGGCAGCGCG CCATCAACTA CTACCGCTTC CTGGTGGGCT TCCTCTTCCC CATCTGCCTG 600 

5 CTGCTGGCGT CCTACCAGGG CATCCTGCGC GCCGTGCGCC GGAGCCACGG CACCCAGAAG 660 

AGCCGCAAGG ACCAGATCCA GCGGCTGGTG CTCAGCACCG TGGTCATCTT CCTGGCCTGC 720 

TTCCTGCCCT ACCACGTGTT GCTGCTGGTG CGCAGCGTCT GGGAGGCCAG CTGCGACTTC 780 

GCCAAGGGCG TTTTCAACGC CTACCACTTC TCCCTCCTGC TCACCAGCTT CAACTGCGTC 840 

GCCGACCCCG TGCTCTACTG CTTCGTCAGC GAGACCACCC ACCGGGACCT GGCCCGCCTC 900 

10 CGCGGGGCCT GCCTGGCCTT CCTCACCTGC TCCAGGACCG GCCGGGCCAG GGAGGCCTAC 960 

CCGCTGGGTG CCCCCGAGGC CTCCGGGAAA AGCGGGGCCC AGGGTGAGGA GCCCGAGCTG 1020 

TTGACCAAGC TCCACCCGGC CTTCCAGACC CCTAACTCGC CAGGGTCGGG CGGGTTCCCC 1080 

ACGGGCAGGT TGGCCTAG 1098 
(119) INFORMATION FOR SEQ ID NO: 118: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

20 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Met Gly Asn He Thr Ala Asp Asn Ser Ser Met Ser Cys Thr He Asp 
1 5 10 15 

His Thr He His Gin Thr Leu Ala Pro Val Val Tyr Val Thr Val Leu 
25 20 25 30 

Val Val Gly Phe Pro Ala Asn Cys Leu Ser Leu Tyr Phe Gly Tyr Leu 
35 40 * 45 

Gin He Lys Ala Arg Asn Glu Leu Gly Val Tyr Leu Cys Asn Leu Thr 
50 55 60 



30 



Val Ala Asp Leu Phe Tyr He Cys Ser Leu Pro Phe Trp Leu Gin Tyr 
65 70 75 80 

Val Leu Gin His Asp Asn Trp Ser His Gly Asp Leu Ser Cys Gin Val 
85 90 95 
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Cys Gly He Leu Leu Tyr Glu Asn He Tyr He Ser Val Gly Phe Leu 
100 105 110 

Cys Cys He Ser Val Asp Arg Tyr Leu Ala Val Ala His Pro Phe Arg 
115 120 125 

5 Phe His Gin Phe Arg Thr Leu Lys Ala Ala Val Gly Val Ser Val Val 

130 135 140 

He Trp Ala Lys Glu Leu Leu Thr Ser lie Tyr Phe Leu Met His Glu 
145 150 155 160 

Glu Val He Glu Asp Glu Asn Gin His Arg Val Cys Phe Glu His Tyr 
10 165 170 175 

Pro He Gin Ala Trp Gin Arg Ala He Asn Tyr Tyr Arg Phe Leu Val 
180 185 190 

Gly Phe Leu Phe Pro He Cys Leu Leu Leu Ala Ser Tyr Gin Gly He 
195 200 205 

15 Leu Arg Ala Val Arg Arg Ser His Gly Thr Gin Lys Ser Arg Lys Asp 

210 215 220 

Gin lie Gin Arg Leu Val Leu Ser Thr Val Val He Phe Leu Ala Cys 
225 230 235 240 



Phe Leu Pro Tyr His Val Leu Leu Leu Val Arg Ser Val Trp Glu Ala 
20 245 250 255 



Ser Cys Asp Phe Ala Lys Gly Val Phe Asn Ala Tyr His Phe Ser Leu 
260 265 270 



Leu Leu Thr Ser Phe Asn Cys Val Ala Asp Pro Val Leu Tyr Cys Phe 
275 280 285 

t 

25 Val Ser Glu Thr Thr His Arg Asp Leu Ala Arg Leu Arg Gly Ala Cys 

290 295 300 



Leu Ala Phe Leu Thr Cys Ser Arg Thr Gly Arg Ala Arg Glu Ala Tyr 
305 310 315 320 

Pro Leu Gly Ala Pro Glu Ala Ser Gly Lys Ser Gly Ala Gin Gly Glu 
30 325 330 335 

Glu Pro Glu Leu Leu Thr Lys Leu His Pro Ala Phe Gin Thr Pro Asn 
340 345 350 

Ser Pro Gly Ser Gly Gly Phe Pro Thr Gly Arg Leu Ala 
355 360 365 



35 (120) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

GACCTCGAGT CCTTCTACAC CTCATC 26 

(121) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 

10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
15 TGCTCTAGAT TCCAGATAGG TGAAAACTTG 30 

(122) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1416 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

ATGGATATTC TTTGTGAAGA AAATACTTCT TTGAGCTCAA CTACGAACTC CCTAATGCAA 60 

25 TTAAATGATG ACAACAGGCT CTACAGTAAT GACTTTAACT CCGGAGAAGC TAACACTTCT 120 

GATGCATTTA ACTGGACAGT CGACTCTGAA AATCGAACCA ACCTTTCCTG TGAAGGGTGC 180 

CTCTCACCGT CGTGTCTCTC CTTACTTCAT CTCCAGGAAA AAAACTGGTC TGCTTTACTG 240 

ACAGCCGTAG TGATTATTCT AACTATTGCT GGAAACATAC TCGTCATCAT GGCAGTGTCC 300 

CTAGAGAAAA AGCTGCAGAA TGCCACCAAC TATTTCCTGA TGTCACTTGC CATAGCTGAT 360 

30 ATGCTGCTGG GTTTCCTTGT CATGCCCGTG TCCATGTTAA CCATCCTGTA TGGGTACCGG 420 

TGGCCTCTGC CGAGCAAGCT TTGTGCAGTC TGGATTTACC TGGACGTGCT CTTCTCCACG 480 

GCCTCCATCA TGCACCTCTG CGCCATCTCG CTGGACCGCT ACGTCGCCAT CCAGAATCCC 540 

ATCCACCACA GCCGCTTCAA CTCCAGAACT AAGGCATTTC TGAAAATCAT TGCTGTTTGG 600 
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ACCATATCAG TAGGTATATC CATGCCAATA CCAGTCTTTG GGCTACAGGA CGATTCGAAG 660 

GTCTTTAAGG AGGGGAGTTG CTTACTCGCC GATGATAACT TTGTCCTGAT CGGCTCTTTT 720 

GTGTCATTTT TCATTCCCTT AACCATCATG GTGATCACCT ACTTTCTAAC TATCAAGTCA 780 

CTCCAGAAAG AAGCTACTTT GTGTGTAAGT GATCTTGGCA CACGGGCCAA ATTAGCTTCT 840 

5 TTCAGCTTCC TCCCTCAGAG TTCTTTGTCT TCAGAAAAGC TCTTCCAGCG GTCGATCCAT 900 

AGGGAGCCAG GGTCCTACAC AGGCAGGAGG ACTATGCAGT CCATCAGCAA TGAGCAAAAG 960 

GCATGCAAGG TGCTGGGCAT CGTCTTCTTC CTGTTTGTGG TGATGTGGTG CCCTTTCTTC 1020 

ATCACAAACA TCATGGCCGT CATCTGCAAA GAGTCCTGCA ATGAGGATGT CATTGGGGCC 1080 

CTGCTCAATG TGTTTGTTTG GATCGGTTAT CTCTCTTCAG CAGTCAACCC ACTAGTCTAC 1140 

10 ACACTGTTCA ACAAGACCTA TAGGTCAGCC TTTTCACGGT ATATTCAGTG TCAGTACAAG 1200 

GAAAACAAAA AACCATTGCA GTTAATTTTA GTGAACACAA TACCGGCTTT GGCCTACAAG 1260 

TCTAGCCAAC TTCAAATGGG ACAAAAAAAG AATTCAAAGC AAGATGCCAA GACAACAGAT 1320 

AATGACTGCT CAATGGTTGC TCTAGGAAAG CAGTATTCTG AAGAGGCTTC TAAAGACAAT 1380 

AGCGACGGAG TGAATGAAAA GGTGAGCTGT GTGTGA 1416 

15 (123) INFORMATION FOR SEQ ID NO:122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

20 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

Met Asp lie Leu Cys Glu Glu Asn Thr Ser Leu Ser Ser Thr Thr Asn 
1 5 10 , 15 

25 Ser Leu Met Gin Leu Asn Asp Asp Asn Arg Leu Tyr Ser Asn Asp Phe 

20 25 30 

Asn Ser Gly Glu Ala Asn Thr Ser Asp Ala Phe Asn Trp Thr Val Asp 
35 40 45 

Ser Glu Asn Arg Thr Asn Leu Ser Cys Glu Gly Cys Leu Ser Pro Ser 
30 50 55 60 

Cys Leu Ser Leu Leu His Leu Gin Glu Lys Asn Trp Ser Ala Leu Leu 
65 70 75 80 
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Thr Ala Val Val He He Leu Thr He Ala Gly Asn He Leu Val He 
85 so 95 

Met Ala Val Ser Leu Glu Lys Lys Leu Gin Asn Ala Thr Asn Tyr Phe 
100 105 no 

5 Leu Met Ser Leu Ala He Ala Asp Met Leu Leu Gly Phe Leu Val Met 

115 120 125 

Pro Val Ser Met Leu Thr He Leu Tyr Gly Tyr Arg Trp Pro Leu Pro 
130 135 140 

Ser Lys Leu Cys Ala Val Trp lie Tyr Leu Asp Val Leu Phe Ser Thr 
10 145 150 155 iso 

Ala Ser He Met His Leu Cys Ala He Ser Leu Asp Arg Tyr Val Ala 
165 170 175 



15 



He Gin Asn Pro lie His His Ser Arg Phe Asn Ser Arg Thr Lys Ala 
180 185 190 

Phe Leu Lys He He Ala Val Trp Thr He Ser Val Gly He Ser Met 
195 200 205 

Pro He Pro Val Phe Gly Leu Gin Asp Asp Ser Lys Val Phe Lys Glu 
210 215 220 

Gly Ser Cys Leu Leu Ala Asp Asp Asn Phe Val Leu He Gly Ser Phe 
20 225 230 235 240 

Val Ser Phe Phe He Pro Leu Thr He Met Val He Thr Tyr Phe Leu 
245 250 255 

Thr He Lys Ser Leu Gin Lys Glu Ala Thr Leu Cys Val Ser Asp Leu 
260 265 270 

25 Gly Thr Arg Ala Lys Leu Ala Ser Phe Ser Phe Leu Pro Gin Ser Ser 

275 280 285 

Leu Ser Ser Glu Lys Leu Phe Gin Arg Ser He His Arg Glu Pro Gly 
29 0 295 300 

Ser Tyr Thr Gly Arg Arg Thr Met Gin Ser He Ser Asn Glu Gin Lys 
30 305 310 315 320 

Ala Cys Lys Val Leu Gly He Val Phe Phe Leu Phe Val Val Met Trp 
325 330 335 

Cys Pro Phe Phe He Thr Asn He Met Ala Val He Cys Lys Glu Ser 
340 345 350 



35 



Cys Asn Glu Asp Val He Gly Ala Leu Leu Asn Val Phe Val Trp He 
355 360 365 

Gly Tyr Leu Ser Ser Ala Val Asn Pro Leu Val Tyr Thr Leu Phe Asn 
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370 375 380 

Lys Thr Tyr Arg Ser Ala Phe Ser Arg Tyr He Gin Cys Gin Tyr Lys 
385 390 395 400 

Glu Asn Lys Lys Pro Leu Gin Leu He Leu Val Asn Thr He Pro Ala 
5 405 410 415 

Leu Ala Tyr Lys Ser Ser Gin Leu Gin Met Gly Gin Lys Lys Asn Ser 
420 425 430 

Lys Gin Asp Ala Lys Thr Thr Asp Asn Asp Cys Ser Met Val Ala Leu 
435 440 445 

10 Gly Lys Gin Tyr Ser Glu Glu Ala Ser Lys Asp Asn Ser Asp Gly Val 

450 455 460 

Asn Glu Lys Val Ser Cys Val 
465 470 

(124) INFORMATION FOR SEQ ID NO: 123: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
GACCTCGAGG TTGCTTAAGA CTGAAGC 2 7 

(125) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

ATTTCTAGAC ATATGTAGCT TGTACCG 27 

(126) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1377 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

ATGGTGAACC TGAGGAATGC GGTGCATTCA TTCCTTGTGC ACCTAATTGG CCTATTGGTT 60 

TGGCAATGTG ATATTTCTGT GAGCCCAGTA GCAGCTATAG TAACTGACAT TTTCAATACC 120 

5 TCCGATGGTG GACGCTTCAA ATTCCCAGAC GGGGTACAAA ACTGGCCAGC ACTTTCAATC 180 

GTCATCATAA TAATCATGAC AATAGGTGGC AACATCCTTG TGATCATGGC AGTAAGCATG 240 

GAAAAGAAAC TGCACAATGC CACCAATTAC TTCTTAATGT CCCTAGCCAT TGCTGATATG 300 

CTAGTGGGAC TACTTGTCAT GCCCCTGTCT CTCCTGGCAA TCCTTTATGA TTATGTCTGG 360 

CCACTACCTA GATATTTGTG CCCCGTCTGG ATTTCTTTAG ATGTTTTATT TTCAACAGCG 420 

10 TCCATCATGC ACCTCTGCGC TATATCGCTG GATCGGTATG TAGCAATACG TAATCCTATT 480 

GAGCATAGCC GTTTCAATTC GCGGACTAAG GCCATCATGA AGATTGCTAT TGTTTGGGCA 540 

ATTTCTATAG GTGTATCAGT TCCTATCCCT GTGATTGGAC TGAGGGACGA AGAAAAGGTG 600 

TTCGTGAACA ACACGACGTG CGTGCTCAAC GACCCAAATT TCGTTCTTAT TGGGTCCTTC 660 

GTAGCTTTCT TCATACCGCT GACGATTATG GTGATTACGT ATTGCCTGAC CATCTACGTT 720 

15 CTGCGCCGAC AAGCTTTGAT GTTACTGCAC GGCCACACCG AGGAACCGCC TGGACTAAGT 780 

CTGGATTTCC TGAAGTGCTG CAAGAGGAAT ACGGCCGAGG AAGAGAACTC TGCAAACCCT 840 

AACCAAGACC AGAACGCACG CCGAAGAAAG AAGAAGGAGA GACGTCCTAG GGGCACCATG 900 

CAGGCTATCA ACAATGAAAG AAAAGCTTCG AAAGTCCTTG GGATTGTTTT CTTTGTGTTT 960 

CTGATCATGT GGTGCCCATT TTTCATTACC AATATTCTGT CTGTTCTTTG TGAGAAGTCC 1020 

20 TGTAACCAAA AGCTCATGGA AAAGCTTCTG AATGTGTTTG TTTGGATTGG CTATGTTTGT 1080 

TCAGGAATCA ATCCTCTGGT GTATACTCTG TTCAACAAAA TTTACCGAAG GGCATTCTCC 1140 

AACTATTTGC GTTGCAATTA TAAGGTAGAG AAAAAGCCTC CTGTCAGGCA GATTCCAAGA 1200 

GTTGCCGCCA CTGCTTTGTC TGGGAGGGAG CTTAATGTTA ACATTTATCG GCATACCAAT 1260 

GAACCGGTGA TCGAGAAAGC CAGTGACAAT GAGCCCGGTA TAGAGATGCA AGTTGAGAAT 1320 

25 TTAGAGTTAC CAGTAAATCC CTCCAGTGTG GTTAGCGAAA GGATTAGCAG TGTGTGA 1377 
(127) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

Met Val Asn Leu Arg Asn Ala Val His Ser Phe Leu Val His Leu lie 
15 10 15 

Gly. Leu Leu Val Trp Gin Cys Asp He Ser Val Ser Pro Val Ala Ala 
20 25 30 

He Val Thr Asp He Phe Asn Thr Ser Asp Gly Gly Arg Phe Lys Phe 
35 40 45 

Pro Asp Gly Val Gin Asn Trp Pro Ala Leu Ser He Val He He He 
50 55 60 

He Met Thr He Gly Gly Asn He Leu Val He Met Ala Val Ser Met 
65 70 75 80 

Glu Lys Lys Leu His Asn Ala Thr Asn Tyr Phe Leu Met Ser Leu Ala 
85 90 95 

He Ala Asp Met Leu Val Gly Leu Leu Val Met Pro Leu Ser Leu Leu 
100 105 110 

Ala He Leu Tyr Asp Tyr Val Trp Pro Leu Pro Arg Tyr Leu Cys Pro 
115 120 125 

Val Trp He Ser Leu Asp Val Leu Phe Ser Thr Ala Ser He Met His 
130 135 140 

Leu Cys Ala He Ser Leu Asp Arg Tyr Val Ala He Arg Asn Pro lie 
145 150 155 160 

Glu His Ser Arg Phe Asn Ser Arg Thr Lys Ala He Met Lys He Ala 
165 170 175 

He Val Trp Ala He Ser lie Gly Val Ser Val Pro He Pro Val He 
180 185 190 

Gly Leu Arg Asp Glu Glu Lys Val Phe Val Asn Asn Thr Thr Cys Val 
195 200 205 

Leu Asn Asp Pro Asn Phe Val Leu He Gly Ser Phe Val Ala Phe Phe 
210 215 220 

lie Pro Leu Thr lie Met Val lie Thr Tyr Cys Leu Thr lie Tyr Val 
225 230 235 240 

Leu Arg Arg Gin Ala Leu Met Leu Leu His Gly His Thr Glu Glu Pro 
245 250 255 
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Pro Gly Leu Ser Leu Asp Phe Leu Lys Cys Cys Lys Arg Asn Thr Ala 
260 265 270 

Glu Glu Glu Asn Ser Ala Asn Pro Asn Gin Asp Gin Asn Ala Arg Arg 
275 280 285 

Arg Lys Lys Lys Glu Arg Arg Pro Arg Gly Thr Met Gin Ala He Asn 
290 295 300 

Asn Glu Arg Lys Ala Ser Lys Val Leu Gly He Val Phe Phe Val Phe 
305 310 315 320 

Leu He Met Trp Cys Pro Phe Phe He Thr Asn He Leu Ser Val Leu 
325 330 335 

Cys Glu Lys Ser Cys Asn Gin Lys Leu Met Glu Lys Leu Leu Asn Val 
340 345 350 

Phe Val Trp He Gly Tyr Val Cys Ser Gly He Asn Pro Leu Val Tyr 
355 360 365 

Thr Leu Phe Asn Lys He Tyr Arg Arg Ala Phe Ser Asn Tyr Leu Arg 
370 375 380 

Cys Asn Tyr Lys Val Glu Lys Lys Pro Pro Val Arg Gin He Pro Arg 
385 390 395 4 00 

Val Ala Ala Thr Ala Leu Ser Gly Arg Glu Leu Asn Val Asn He Tyr 
405 410 415 

Arg His Thr Asn Glu Pro Val He Glu Lys Ala Ser Asp Asn Glu Pro 
420 425 430 

Gly He Glu Met Gin Val Glu Asn Leu Glu Leu Pro Val Asn Pro Ser 
435 440 445 

Ser Val Val Ser Glu 'Arg He Ser Ser Val 
450 455 

(128) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 
GGTAAGCTTG GCAGTCCACG CCAGGCCTTC 30 

(129) INFORMATION FOR SEQ ID NO: 12 8: 



WO 00/22129 PCT/US99/23938 

101 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
TCCGAATTCT CTGTAGACAC AAGGCTTTGG 30 
(130) INFORMATION FOR SEQ ID NO: 129: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

ATGGATCAGT TCCCTGAATC AGTGACAGAA AACTTTGAGT ACGATGATTT GGCTGAGGCC 60 

TGTTATATTG GGGACATCGT GGTCTTTGGG ACTGTGTTCC TGTCCATATT CTACTCCGTC 120 

ATCTTTGCCA TTGGCCTGGT GGGAAATTTG TTGGTAGTGT TTGCCCTCAC CAACAGCAAG 180 

20 AAGCCCAAGA GTGTCACCGA CATTTACCTC CTGAACCTGG CCTTGTCTGA TCTGCTGTTT 24 0 

GTAGCCACTT TGCCCTTCTG GACTCACTAT TTGATAAATG AAAAGGGCCT CCACAATGCC 300 

ATGTGCAAAT TCACTACCGC CTTCTTCTTC ATCGGCTTTT TTGGAAGCAT ATTCTTCATC 360 

ACCGTCATCA GCATTGATAG GTACCTGGCC ATCGTCCTGG CCGCCAACTC CATGAACAAC 420 

CGGACCGTGC AGCATGGCGT CACCATCAGC CTAGGCGTCT GGGCAGCAGC CATTTTGGTG 480 

25 GCAGCACCCC AGTTCATGTT CACAAAGCAG AAAGAAAATG AATGCCTTGG TGACTACCCC 540 

GAGGTCCTCC AGGAAATCTG GCCCGTGCTC CGCAATGTGG AAACAAATTT TCTTGGCTTC 600 

CTACTCCCCC TGCTCATTAT GAGTTATTGC TACTTCAGAA TCATCCAGAC GCTGTTTTCC 660 

TGCAAGAACC ACAAGAAAGC CAAAGCCATT AAACTGATCC TTCTGGTGGT CATCGTGTTT 720 

TTCCTCTTCT GGACACCCTA CAACGTTATG ATTTTCCTGG AGACGCTTAA GCTCTATGAC 780 

30 TTCTTTCCCA GTTGTGACAT GAGGAAGGAT CTGAGGCTGG CCCTCAGTGT GACTGAGACG 840 

GTTGCATTTA GCCATTGTTG CCTGAATCCT CTCATCTATG CATTTGCTGG GGAGAAGTTC 900 

AGAAGATACC TTTACCACCT GTATGGGAAA TGCCTGGCTG TCCTGTGTGG GCGCTCAGTC 960 
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CACGTTGATT TCTCCTCATC TGAATCACAA AGGAGCAGGC ATGGAAGTGT TCTGAGCAGC 1020 
AATTTTACTT ACCACACGAG TGATGGAGAT GCATTGCTCC TTCTCTGA 1068 
(131) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 0: 

Met Asp Gin Phe Pro Glu Ser Val Thr Glu Asn Phe Glu Tyr Asp Asp 
15 io 15 

Leu Ala Glu Ala Cys Tyr lie Gly Asp He Val Val Phe Gly Thr Val 
20 25 30 

Phe Leu Ser He Phe Tyr Ser Val He Phe Ala He Gly Leu Val Gly 
35 40 45 

Asn Leu Leu Val Val Phe Ala Leu Thr Asn Ser Lys Lys Pro Lys Ser 
50 55 60 

Val Thr Asp He Tyr Leu Leu Asn Leu Ala Leu Ser Asp Leu Leu Phe 
65 70 75 80 

Val Ala Thr Leu Pro Phe Trp Thr His Tyr Leu He Asn Glu Lys Gly 
85 90 95 

Leu His Asn Ala Met Cys Lys Phe Thr Thr Ala Phe Phe Phe He Gly 
100 105 no 

Phe Phe Gly Ser He Phe Phe He Thr Val He Ser He Asp Arg Tyr 
115 120 125 

Leu Ala He Val Leu Ala Ala Asn Ser Met Asn Asn Arg Thr Val Gin 
130 135 140 

His Gly Val Thr He Ser Leu Gly Val Trp Ala Ala Ala He Leu Val 
145 150 155 160 

Ala Ala Pro Gin Phe Met Phe Thr Lys Gin Lys Glu Asn Glu Cys Leu 
165 170 175 

Gly Asp Tyr Pro Glu Val Leu Gin Glu He Trp Pro Val Leu Arg Asn 
180 185 190 

Val Glu Thr Asn Phe Leu Gly Phe Leu Leu Pro Leu Leu He Met Ser 
195 200 205 
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Tyr Cys Tyr Phe Arg lie lie Gin Thr Leu Phe Ser Cys Lys Asn His 
210 215 220 

Lys Lys Ala Lys Ala lie Lys Leu lie Leu Leu Val Val lie Val Phe 
225 230 235 240 

Phe Leu Phe Trp Thr Pro Tyr Asn Val Met He Phe Leu Glu Thr Leu 
245 250 255 

Lys Leu Tyr Asp Phe Phe Pro Ser Cys Asp Met Arg Lys Asp Leu Arg 
260 265 270 

Leu Ala Leu Ser Val Thr Glu Thr Val Ala Phe Ser His Cys Cys Leu 
275 280 285 

Asn Pro Leu He Tyr Ala Phe Ala Gly Glu Lys Phe Arg Arg Tyr Leu 
290 295 300 

Tyr His Leu Tyr Gly Lys Cys Leu Ala Val Leu Cys Gly Arg Ser Val 
305 310 315 320 

His Val Asp Phe Ser Ser Ser Glu Ser Gin Arg Ser Arg His Gly Ser 
325 330 335 

Val Leu Ser Ser Asn Phe Thr Tyr His Thr Ser Asp Gly Asp Ala Leu 
340 345 350 

Leu Leu Leu 
355 

(132) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
GATCTCCAGT AGGCATAAGT GGACAATTCT GG 32 

(133) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
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CTCCTTCGGT CCTCCTATCG TTGTCAGAAG 

(134) INFORMATION FOR SEQ ID NO:133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 
AGAAGGCCAA GATCGCGCGG CTGGCCCTCA 

(135) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
CGGCGCCACC GCACGAAAAA GCTCATCTTC 

(136) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 
GCCAAGAAGC GGGTGAAGTT CCTGGTGGTG GCA 
(137) INFORMATION FOR SEQ ID NO:136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:136: 
CAGGCGGAAG GTGAAAGTCC TGGTCCTCGT 

(138) INFORMATION FOR SEQ ID NO:137; 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

CGGCGCCTGC GGGCCAAGCG GCTGGTGGTG GTG 

(139) INFORMATION FOR SEQ ID NO:138: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 

15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
20 CCAAGCACAA AGCCAAGAAA GTGACCATCA C 

(140) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
GCGCCGGCGC ACCAAATGCT TGCTGGTGGT 
30 (141) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
CAAAAAGCTG AAGAAATCTA AGAAGATCAT CTTTATTGTC G 

(142) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
CAAGACCAAG GCAAAACGCA TGATCGCCAT 

(143) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 
GTCAAGGAGA AGTCCAAAAG GATCATCATC 

(144) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:143: 
CGCCGCGTGC GGGCCAAGCA GCTCCTGCTC 

(145) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-144: 
CCTGATAAGC GCTATAAAAT GGTCCTGTTT CGA 

(146) INFORMATION FOR SEQ ID NO: 14 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
GAAAGACAAA AGAGAGTCAA GAGGATGTCT TTATTG 

(147) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 
CGGAGAAAGA GGGTGAAACG CACAGCCATC GCC 

(148) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 
AAGCTTCAGC GGGCCAAGGC ACTGGTCACC 

(149) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 
CAGCGGCAGA AGGCAAAAAG GGTGGCCATC 30 

(150) INFORMATION FOR SEQ ID NO:149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
CGGCAGAAGG CGAAGCGCAT GATCCTCGCG 30 

(151) INFORMATION FOR SEQ ID NO:150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:150: 
GAGCGCAACA AGGCCAAAAA GGTGATCATC 30 

(152) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:151: 
GGTGTAAACA AAAAGGCTAA AAACACAATT ATTCTTATT 39 

(153) INFORMATION FOR SEQ ID NO:152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 
GAGAGCCAGC TCAAGAGCAC CGTGGTG 27 

(154) INFORMATION FOR SEQ ID NO: 153: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 
CCACAAGCAA ACCAAGAAAA TGCTGGCTGT 30 

(155) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

CATCAAGTGT ATCATGTGCC AAGTACGCCC 30 

(156) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 

25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
30 CTAGAGAGTC AGATGAAGTG TACAGTAGTG GCAC 34 

(157) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
CGGACAAAAG TGAAAACTAA AAAGATGTTC CTCATT 

(158) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 
GCTGAGGTTC GCAATAAACT AACCATGTTT GTG 

(159) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 
GGGAGGCCGA GCTGAAAGCC ACCCTGCTC 

(160) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:159: 
CAAGATCAAG AGAGCCAAAA CCTTCATCAT G 

(161) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
CCGGAGACAA GTGAAGAAGA TGCTGTTTGT C 31 

(162) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 
GCAAGGACCA GATCAAGCGG CTGGTGCTCA 3 0 

(163) INFORMATION FOR SEQ ID NO: 162: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:162: 
CAAGAAAGCC AAAGCCAAGA AACTGATCCT TCTG 34 

(164) INFORMATION FOR SEQ ID NO:163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

ATGGAAGATT TGGAGGAAAC ATTATTTGAA GAATTTGAAA ACTATTCCTA TGACCTAGAC 60 

TATTACTCTC TGGAGTCTGA TTTGGAGGAG AAAGTCCAGC TGGGAGTTGT TCACTGGGTC 120 

TCCCTGGTGT TATATTGTTT GGCTTTTGTT CTGGGAATTC CAGGAAATGC CATCGTCATT 180 

TGGTTCACGG GGCTCAAGTG GAAGAAGACA GTCACCACTC TGTGGTTCCT CAATCTAGCC 240 

ATTGCGGATT TCATTTTTCT TCTCTTTCTG CCCCTGTACA TCTCCTATGT GGCCATGAAT 300 
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TTCCACTGGC CCTTTGGCAT CTGGCTGTGC AAAGCCAATT CCTTCACTGC CCAGTTGAAC 360 

ATGTTTGCCA GTGTTTTTTT CCTGACAGTG ATCAGCCTGG ACCACTATAT CCACTTGATC 420 

CATCCTGTCT TATCTCATCG GCATCGAACC CTCAAGAACT CTCTGATTGT CATTATATTC 480 

ATCTGGCTTT TGGCTTCTCT AATTGGCGGT CCTGCCCTGT ACTTCCGGGA CACTGTGGAG 540 

TTCAATAATC ATACTCTTTG CTATAACAAT TTTCAGAAGC ATGATCCTGA CCTCACTTTG 600 

ATCAGGCACC ATGTTCTGAC TTGGGTGAAA TTTATCATTG GCTATCTCTT CCCTTTGCTA 66 0 

ACAATGAGTA TTTGCTACTT GTGTCTCATC TTCAAGGTGA AGAAGCGAAC AGTCCTGATC 720 

TCCAGTAGGC ATAAGTGGAC AATTCTGGTT GTGGTTGTGG CCTTTGTGGT TTGCTGGACT 780 

CCTTATCACC TGTTTAGCAT TTGGGAGCTC ACCATTCACC ACAATAGCTA TTCCCACCAT 840 

GTGATGCAGG CTGGAATCCC CCTCTCCACT GGTTTGGCAT TCCTCAATAG TTGCTTGAAC 900 

CCCATCCTTT ATGTCCTAAT TAGTAAGAAG TTCCAAGCTC GCTTCCGGTC CTCAGTTGCT 960 

GAGATACTCA AGTACACACT GTGGGAAGTC AGCTGTTCTG GCACAGTGAG TGAACAGCTC 1020 

AGGAACTCAG AAACCAAGAA TCTGTGTCTC CTGGAAACAG CTCAATAA 1068 
(165) INFORMATION FOR SEQ ID NO:164: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Met Glu Asp Leu Glu Glu Thr Leu Phe Glu Glu Phe Glu Asn Tyr Ser 
15 10 is 

Tyr Asp Leu Asp Tyr Tyr Ser Leu Glu Ser Asp Leu Glu Glu Lys Val 
20 25 30 

Gin Leu Gly Val Val His Trp Val Ser Leu Val Leu Tyr Cys Leu Ala 
35 40 45 

Phe Val Leu Gly lie Pro Gly Asn Ala He Val He Trp Phe Thr Gly 
50 55 60 

Leu Lys Trp Lys Lys Thr Val Thr Thr Leu Trp Phe Leu Asn Leu Ala 
65 70 75 80 

He Ala Asp Phe He Phe Leu Leu Phe Leu Pro Leu Tyr He Ser Tyr 
85 go 95 



WO00/22129 PCT/US99/23938 

113 

Val Ala Met Asn Phe His Trp Pro Phe Gly He Trp Leu Cys Lys Ala 
100 105 HO 

Asn Ser Phe Thr Ala Gin Leu Asn Met Phe Ala Ser Val Phe Phe Leu 
115 120 125 

Thr Val He Ser Leu Asp His Tyr He His Leu He His Pro Val Leu 
130 135 140 

Ser His Arg His Arg Thr Leu Lys Asn Ser Leu He Val He He Phe 
145 150 155 160 

He Trp Leu Leu. Ala Ser Leu He Gly Gly Pro Ala Leu Tyr Phe Arg 
165 170 175 

Asp Thr Val Glu Phe Asn Asn His Thr Leu Cys Tyr Asn Asn Phe Gin 
180 185 190 

Lys His Asp Pro Asp Leu Thr Leu He Arg His His Val Leu Thr Trp 
195 200 205 

Val Lys Phe He He Gly Tyr Leu Phe Pro Leu Leu Thr Met Ser He 
210 215 220 

Cys Tyr Leu Cys Leu He Phe Lys Val Lys Lys Arg Thr Val Leu He 
225 230 235 240 

Ser Ser Arg His Lys Trp Thr He Leu Val Val Val Val Ala Phe Val 
245 250 255 

Val Cys Trp Thr Pro Tyr His Leu Phe Ser He Trp Glu Leu Thr He 
260 265 270 

His His Asn Ser Tyr Ser His His Val Met Gin Ala Gly He Pro Leu 
275 280 285 

Ser Thr Gly Leu Ala Phe Leu Asn Ser Cys Leu Asn Pro He Leu Tyr 
290 295 300 

Val Leu He Ser Lys Lys Phe Gin Ala Arg Phe Arg Ser Ser Val Ala 
305 310 315 320 

Glu He Leu Lys Tyr Thr Leu Trp Glu Val Ser Cys Ser Gly Thr Val 
325 330 335 

Ser Glu Gin Leu Arg Asn Ser Glu Thr Lys Asn Leu Cys Leu Leu Glu 
340 345 350 

Thr Ala Gin 
355 

(166) INFORMATION FOR SEQ ID NO: 165 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1089 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:165: 

ATGGGCAACC ACACGTGGGA GGGCTGCCAC GTGGACTCGC GCGTGGACCA CCTCTTTCCG 60 

CCATCCCTCT ACATCTTTGT CATCGGCGTG GGGCTGCCCA CCAACTGCCT GGCTCTGTGG 120 

GCGGCCTACC GCCAGGTGCA ACAGCGCAAC GAGCTGGGCG TCTACCTGAT GAACCTCAGC 180 

ATCGCCGACC TGCTGTACAT CTGCACGCTG CCGCTGTGGG TGGACTACTT CCTGCACCAC 240 

10 GACAACTGGA TCCACGGCCC CGGGTCCTGC AAGCTCTTTG GGTTCATCTT CTACACCAAT 300 

ATCTACATCA GCATCGCCTT CCTGTGCTGC ATCTCGGTGG ACCGCTACCT GGCTGTGGCC 360 

CACCCACTCC GCTTCGCCCG CCTGCGCCGC GTCAAGACCG CCGTGGCCGT GAGCTCCGTG 420 

GTCTGGGCCA CGGAGCTGGG CGCCAACTCG GCGCCCCTGT TCCATGACGA GCTCTTCCGA 480 

GACCGCTACA ACCACACCTT CTGCTTTGAG AAGTTCCCCA TGGAAGGCTG GGTGGCCTGG 540 

15 ATGAACCTCT ATCGGGTGTT CGTGGGCTTC CTCTTCCCGT GGGCGCTCAT GCTGCTGTCG 600 

TACCGGGGCA TCCTGCGGGC CGTGCGGGGC AGCGTGTCCA CCGAGCGCCA GGAGAAGGCC 660 

AAGATCGCGC GGCTGGCCCT CAGCCTCATC GCCATCGTGC TGGTCTGCTT TGCGCCCTAT 720 

CACGTGCTCT TGCTGTCCCG CAGCGCCATC TACCTGGGCC GCCCCTGGGA CTGCGGCTTC 780 

GAGGAGCGCG TCTTTTCTGC ATACCACAGC TCACTGGCTT TCACCAGCCT CAACTGTGTG 840 

20 GCGGACCCCA TCCTCTACTG CCTGGTCAAC GAGGGCGCCC GCAGCGATGT GGCCAAGGCC 900 

CTGCACAACC TGCTCCGCTT TCTGGCCAGC GACAAGCCCC AGGAGATGGC CAATGCCTCG 96 0 

CTCACCCTGG AGACCCCACT CACCTCCAAG AGGAACAGCA CAGCCAAAGC CATGACTGGC 1020 

AGCTGGGCGG CCACTCCGCC TTCCCAGGGG GACCAGGTGC AGCTGAAGAT GCTGCCGCCA 1080 

GCACAATGA 1Q89 

25 (167) INFORMATION FOR SEQ ID NO:166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

30 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Met Gly Asn His Thr Trp Glu Gly Cys His Val Asp Ser Arg Val Asp 
15 10 15 

His Leu Phe Pro Pro Ser Leu Tyr lie Phe Val lie Gly Val Gly Leu 
20 25 30 

Pro Thr Asn Cys Leu Ala Leu Trp Ala Ala Tyr Arg Gin Val Gin Gin 
35 40 45 

Arg Asn Glu Leu Gly Val Tyr Leu Met Asn Leu Ser lie Ala Asp Leu 
50 55 • 60 

Leu Tyr lie Cys Thr Leu Pro Leu Trp Val Asp Tyr Phe Leu His His 
65 70 75 80 

Asp Asn Trp lie His Gly Pro Gly Ser Cys Lys Leu Phe Gly Phe lie 
85 90 95 

Phe Tyr Thr Asn He Tyr He Ser He Ala Phe Leu Cys Cys He Ser 
100 105 110 

Val Asp Arg Tyr Leu Ala Val Ala His Pro Leu Arg Phe Ala Arg Leu 
115 120 125 

Arg Arg Val Lys Thr Ala Val Ala Val Ser Ser Val Val Trp Ala Thr 
130 135 140 

Glu Leu Gly Ala Asn Ser Ala Pro Leu Phe His Asp Glu Leu Phe Arg 
145 150 155 160 

Asp Arg Tyr Asn His Thr Phe Cys Phe Glu Lys Phe Pro Met Glu Gly 
165 170 175 

Trp Val Ala Trp Met Asn Leu Tyr Arg Val Phe Val Gly Phe Leu Phe 
180 185 190 

Pro Trp Ala Leu Met Leu Leu Ser Tyr Arg Gly He Leu Arg Ala Val 
195 200 205 

Arg Gly Ser Val Ser Thr Glu Arg Gin Glu Lys Ala Lys He Ala Arg 
210 215 220 

Leu Ala Leu Ser Leu He Ala He Val Leu Val Cys Phe Ala Pro Tyr 
225 230 235 240 

His Val Leu Leu Leu Ser Arg Ser Ala He Tyr Leu Gly Arg Pro Trp 
245 250 255 

Asp Cys Gly Phe Glu Glu Arg Val Phe Ser Ala Tyr His Ser Ser Leu 
260 265 270 



Ala Phe Thr Ser Leu Asn Cys Val Ala Asp Pro He Leu Tyr Cys Leu 
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275 280 285 

Val Asn Glu Gly Ala Arg Ser Asp Val Ala Lys Ala Leu His Asn Leu 
290 295 300 

Leu Arg Phe Leu Ala Ser Asp Lys Pro Gin Glu Met Ala Asn Ala Ser 
305 310 315 320 

Leu Thr Leu Glu Thr Pro Leu Thr Ser Lys Arg Asn Ser Thr Ala Lys 
325 330 335 

Ala Met Thr Gly Ser Trp Ala Ala Thr Pro Pro Ser Gin Gly Asp Gin 
340 345 350 

Val Gin Leu Lys Met Leu Pro Pro Ala Gin 
355 360 

(168) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1002 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

ATGGAGTCCT CAGGCAACCC AGAGAGCACC ACCTTTTTTT ACTATGACCT TCAGAGCCAG 60 

CCGTGTGAGA ACCAGGCCTG GGTCTTTGCT ACCCTCGCCA CCACTGTCCT GTACTGCCTG 12 0 

GTGTTTCTCC TCAGCCTAGT GGGCAACAGC CTGGTCCTGT GGGTCCTGGT GAAGTATGAG 180 

AGCCTGGAGT CCCTCACCAA CATCTTCATC CTCAACCTGT GCCTCTCAGA CCTGGTGTTC 240 

GCCTGCTTGT TGCCTGTGTG GATCTCCCCA TACCACTGGG GCTGGGTGCT GGGAGACTTC 3 00 

CTCTGCAAAC TCCTCAATAT GATCTTCTCC ATCAGCCTCT ACAGCAGCAT CTTCTTCCTG 360 

ACCATCATGA CCATCCACCG CTACCTGTCG GTAGTGAGCC CCCTCTCCAC CCTGCGCGTC 420 

CCCACCCTCC GCTGCCGGGT GCTGGTGACC ATGGCTGTGT GGGTAGCCAG CATCCTGTCC 480 

TCCATCCTCG ACACCATCTT CCACAAGGTG CTTTCTTCGG GCTGTGATTA TTCCGAACTC 540 

ACGTGGTACC TCACCTCCGT CTACCAGCAC AACCTCTTCT TCCTGCTGTC CCTGGGGATT 600 

ATCCTGTTCT GCTACGTGGA GATCCTCAGG ACCCTGTTCC GCTCACGCTC CAAGCGGCGC 660 

CACCGCACGA AAAAGCTCAT CTTCGCCATC GTGGTGGCCT ACTTCCTCAG CTGGGGTCCC 720 

TACAACTTCA CCCTGTTTCT GCAGACGCTG TTTCGGACCC AGATCATCCG GAGCTGCGAG 780 
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GCCAAACAGC AGCTAGAATA CGCCCTGCTC ATCTGCCGCA ACCTCGCCTT CTCCCACTGC 840 
TGCTTTAACC CGGTGCTCTA TGTCTTCGTG GGGGTCAAGT TCCGCACACA CCTGAAACAT 900 
GTTCTCCGGC AGTTCTGGTT CTGCCGGCTG CAGGCACCCA GCCCAGCCTC GATCCCCCAC 960 
TCCCCTGGTG CCTTCGCCTA TGAGGGCGCC TCCTTCTACT GA 1002 
(169) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

Met Glu Ser Ser Gly Asn Pro Glu Ser Thr Thr Phe Phe Tyr Tyr Asp 
15 10 15 

Leu Gin Ser Gin Pro Cys Glu Asn Gin Ala Trp Val Phe Ala Thr Leu 
20 25 30 

Ala Thr Thr Val Leu Tyr Cys Leu Val Phe Leu Leu Ser Leu Val Gly 
35 40 45 

Asn Ser Leu Val Leu Trp Val Leu Val Lys Tyr Glu Ser Leu Glu Ser 
50 55 60 

Leu Thr Asn lie Phe lie Leu Asn Leu Cys Leu Ser Asp Leu Val Phe 
65 70 75 80 

Ala Cys Leu Leu Pro Val Trp lie Ser Pro Tyr His Trp Gly Trp Val 
85 90 95 

Leu Gly Asp Phe Leu Cys Lys Leu Leu Asn Met lie Phe Ser lie Ser 
100 105 110 

Leu Tyr Ser Ser lie Phe Phe Leu Thr He Met Thr He His Arg Tyr 
115 120 125 

Leu Ser Val Val Ser Pro Leu Ser Thr Leu Arg Val Pro Thr Leu Arg 
130 135 140 

Cys Arg Val Leu Val Thr Met Ala Val Trp Val Ala Ser He Leu Ser 
145 ' 150 155 160 

Ser He Leu Asp Thr He Phe His Lys Val Leu Ser Ser Gly Cys Asp 
165 170 175 

Tyr Ser Glu Leu Thr Trp Tyr Leu Thr Ser Val Tyr Gin His Asn Leu 
180 185 190 
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Phe Phe Leu Leu Ser Leu Gly lie lie Leu Phe Cys Tyr Val Glu lie 
195 200 205 

Leu Arg Thr Leu Phe Arg Ser Arg Ser Lys Arg Arg His Arg Thr Lys 
210 215 220 

5 Lys Leu lie Phe Ala lie Val Val Ala Tyr Phe Leu Ser Trp Gly Pro 

225 230 235 240 

Tyr Asn Phe Thr Leu Phe Leu Gin Thr Leu Phe Arg Thr Gin lie lie 
245 250 255 

Arg Ser Cys Glu Ala Lys Gin Gin Leu Glu Tyr Ala Leu Leu lie Cys 
10 260 265 270 

Arg Asn Leu Ala Phe Ser His Cys Cys Phe Asn Pro Val Leu Tyr Val 
275 280 285 

Phe Val Gly Val Lys Phe Arg Thr His Leu Lys His Val Leu Arg Gin 
290 295 300 

15 Phe Trp Phe Cys Arg Leu Gin Ala Pro Ser Pro Ala Ser lie Pro His 

305 310 315 320 

Ser Pro Gly Ala Phe Ala Tyr Glu Gly Ala Ser Phe Tyr 
325 330 

(170) INFORMATION FOR SEQ ID NO: 169 : 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 9: 

ATGGACAACG CCTCGTTCTC GGAGCCCTGG CCCGCCAACG CATCGGGCCC GGACCCGGCG 60 

CTGAGCTGCT CCAACGCGTC GACTCTGGCG CCGCTGCCGG CGCCGCTGGC GGTGGCTGTA 120 

CCAGTTGTCT ACGCGGTGAT CTGCGCCGTG GGTCTGGCGG GCAACTCCGC CGTGCTGTAC 180 

30 GTGTTGCTGC GGGCGCCCCG CATGAAGACC GTCACCAACC TGTTCATCCT CAACCTGGCC 240 

ATCGCCGACG AGCTCTTCAC GCTGGTGCTG CCCATCAACA TCGCCGACTT CCTGCTGCGG 300 

CAGTGGCCCT TCGGGGAGCT CATGTGCAAG CTCATCGTGG CTATCGACCA GTACAACACC 360 

TTCTCCAGCC TCTACTTCCT CACCGTCATG AGCGCCGACC GCTACCTGGT GGTGTTGGCC 420 

ACTGCGGAGT CGCGCCGGGT GGCCGGCCGC ACCTACAGCG CCGCGCGCGC GGTGAGCCTG 480 
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GCCGTGTGGG GGATCGTCAC ACTCGTCGTG CTGCCCTTCG CAGTCTTCGC CCGGCTAGAC 540 

GACGAGCAGG GCCGGCGCCA GTGCGTGCTA GTCTTTCCGC AGCCCGAGGC CTTCTGGTGG 600 

CGCGCGAGCC GCCTCTACAC GCTCGTGCTG GGCTTCGCCA TCCCCGTGTC CACCATCTGT 660 

GTCCTCTATA CCACCCTGCT GTGCCGGCTG CATGCCATGC GGCTGGACAG CCACGCCAAG 720 

5 GCCCTGGAGC GCGCCAAGAA GCGGGTGAAG TTCCTGGTGG TGGCAATCCT GGCGGTGTGC 780 

CTCCTCTGCT GGACGCCCTA CCACCTGAGC ACCGTGGTGG CGCTCACCAC CGACCTCCCG 840 

CAGACGCCGC TGGTCATCGC TATCTCCTAC TTCATCACCA GCCTGACGTA CGCCAACAGC 900 

TGCCTCAACC CCTTCCTCTA CGCCTTCCTG GACGCCAGCT TCCGCAGGAA CCTCCGCCAG 960 

CTGATAACTT GCCGCGCGGC AGCCTGA 987 
10 (171) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

15 * (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

Met Asp Asn Ala Ser Phe Ser Glu Pro Trp Pro Ala Asn Ala Ser Gly 
1 5 10 15 

20 Pro Asp Pro Ala Leu Ser Cys Ser Asn Ala Ser Thr Leu Ala Pro Leu 

20 25 30 

Pro Ala Pro Leu Ala Val Ala Val Pro Val Val Tyr Ala Val He Cys 
35 40 45 

Ala Val Gly Leu Ala Gly Asn Ser Ala Val Leu Tyr Val Leu Leu Arg 
25 50 55 60 

Ala Pro Arg Met Lys Thr Val Thr Asn Leu Phe He Leu Asn Leu Ala 
65 70 75 80 

He Ala Asp Glu Leu Phe Thr Leu Val Leu Pro He Asn He Ala Asp 
85 90 95 

30 Phe Leu Leu Arg Gin Trp Pro Phe Gly Glu Leu Met Cys Lys Leu He 

100 105 " 110 

Val Ala He Asp Gin Tyr Asn Thr Phe Ser Ser Leu Tyr Phe Leu Thr 
115 120 125 

Val Met Ser Ala Asp Arg Tyr Leu Val Val Leu Ala Thr Ala Glu Ser 
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120 

130 135 140 

Arg Arg Val Ala Gly Arg Thr Tyr Ser Ala Ala Arg Ala Val Ser Leu 
145 150 155 160 

Ala Val Trp Gly lie Val Thr Leu Val Val Leu Pro Phe Ala Val Phe 
165 170 175 

Ala Arg Leu Asp Asp Glu Gin Gly Arg Arg Gin Cys Val Leu Val Phe 
180 185 190 

Pro Gin Pro Glu Ala Phe Trp Trp Arg Ala Ser Arg Leu Tyr Thr Leu 
195 200 205 

Val Leu Gly Phe Ala He Pro Val Ser Thr He Cys Val Leu Tyr Thr 
210 215 220 

Thr Leu Leu Cys Arg Leu His Ala Met Arg Leu Asp Ser His Ala Lys 
225 230 235 2 40 

Ala Leu Glu Arg Ala Lys Lys Arg Val Lys Phe Leu Val Val Ala He 
245 250 255 

Leu Ala Val Cys Leu Leu Cys Trp Thr Pro Tyr His Leu Ser Thr Val 
260 265 270 

Val Ala Leu Thr Thr Asp Leu Pro Gin Thr Pro Leu Val He Ala He 
275 280 285 

Ser Tyr Phe He Thr Ser Leu Thr Tyr Ala Asn Ser Cys Leu Asn Pro 
290 295 300 

Phe Leu Tyr Ala Phe Leu Asp Ala Ser Phe Arg Arg Asn Leu Arg Gin 
305 310 315 320 

Leu He Thr Cys Arg Ala Ala Ala 
25 325 

(172) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1002 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

ATGCAGGCCG CTGGGCACCC AGAGCCCCTT GACAGCAGGG GCTCCTTCTC CCTCCCCACG 60 

35 ATGGGTGCCA ACGTCTCTCA GGACAATGGC ACTGGCCACA ATGCCACCTT CTCCGAGCCA 120 

• CTGCCGTTCC TCTATGTGCT CCTGCCCGCC GTGTACTCCG GGATCTGTGC TGTGGGGCTG 180 



20 



WO 00/22129 



PCT/US99/23938 



121 



ACTGGCAACA CGGCCGTCAT CCTTGTAATC CTAAGGGCGC CCAAGATGAA GACGGTGACC 
AACGTGTTCA TCCTGAACCT GGCCGTCGCC GACGGGCTCT TCACGCTGGT ACTGCCTGTC 
AACATCGCGG AGCACCTGCT GCAGTACTGG CCCTTCGGGG AGCTGCTCTG CAAGCTGGTG 
CTGGCCGTCG ACCACTACAA CATCTTCTCC AGCATCTACT TCCTAGCCGT GATGAGCGTG 
5 GACCGATACC TGGTGGTGCT GGCCACCGTG AGGTCCCGCC ACATGCCCTG GCGCACCTAC 
CGGGGGGCGA AGGTCGCCAG CCTGTGTGTC TGGCTGGGCG TCACGGTCCT GGTTCTGCCC 
TTCTTCTCTT TCGCTGGCGT CTACAGCAAC GAGCTGCAGG TCCCAAGCTG TGGGCTGAGC 
TTCCCGTGGC CCGAGCAGGT CTGGTTCAAG GCCAGCCGTG TCTACACGTT GGTCCTGGGC 
TTCGTGCTGC CCGTGTGCAC CATCTGTGTG CTCTACACAG ACCTCCTGCG CAGGCTGCGG 

10 GCCGTGCGGC TCCGCTCTGG AGCCAAGGCT CTAGGCAAGG CCAGGCGGAA GGTGAAAGTC 
CTGGTCCTCG TCGTGCTGGC CGTGTGCCTC CTCTGCTGGA CGCCCTTCCA CCTGGCCTCT 
GTCGTGGCCC TGACCACGGA CCTGCCCCAG ACCCCACTGG TCATCAGTAT GTCCTACGTC 
ATCACCAGCC TCACGTACGC CAACTCGTGC CTGAACCCCT TCCTCTACGC CTTTCTAGAT 
GACAACTTCC GGAAGAACTT CCGCAGCATA TTGCGGTGCT GA 

15 (173) INFORMATION FOR SEQ ID NO: 172: 



240 
300 
3C0 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1002 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



Met Gin Ala Ala Gly His Pro Glu Pro Leu Asp Ser Arg Gly Ser Phe 
1 5 10 15 



25 



Ser Leu Pro Thr Met Gly Ala Asn Val Ser Gin Asp Asn Gly Thr Gly 
20 25 30 



His Asn Ala Thr Phe Ser Glu Pro Leu Pro Phe Leu Tyr Val Leu Leu 
35 40 45 



30 



Pro Ala Val Tyr Ser Gly He Cys Ala Val Gly Leu Thr Gly Asn Thr 
50 55 60 



Ala Val He Leu Val He Leu Arg Ala Pro Lys Met Lys Thr Val Thr 
65 70 75 80 



